Journal o rf thee 


AMERICAN STATISTICAL 


ASSOCIATION 


JUNE 1960 


A Multiplicative Model for Anal: Variances which are Affected by Several 


Changes in the Rate and Miata of Household Formation ......-.-...-+ 


A Statistical Investigation of the Industrialization Controversy ..........+..++. 


Optimal Properties of Exponentially Weighted Forecasts ........ John F. Muth 


Large-Sample Covariance Analysis when the Control Variable is Fallible ...... 


Variance of the Median of Samples from a Cauchy Distribution .. . Paul R. Rider 
Tests of the Hypothesis that a Linear Regression System Obeys Two Separate 


A Note on Average Tau as a Measure of Concordance ........ William L. Hays 
Estimation in the Truncated Poisson Distribution when Zeros and Some Ones are 


SUMMARIES OF PAPERS DELIVERED AT 119TH ANNUAL MEETING . 


BOOK REVIEWS by George W. Barclay, V. Lewis Bassie, E. F. Beach, Patrick ; 


Billi y, Mark S. Blumberg, Colin R. Blyth, Hollis B. Chenery, Francis G. 
Cornell, B. E. Ellison, George E. Forsythe, D. Gale Johnson, Evelyn M. 
Kitagawa, Irving Kravis, G. J. Lieberman, Clarence D. Long, Lionel Mc- 
Kenzie, William G. Madow, M. W. Reder, Henry S. Shryock, Ltr, Rosedith 
Sitgreaves, Richard Stone, Herman Wold ...............-.6e0-eeeeues 


aa at BOARD OF DIRECTORS AND SECRETARY-TREASURER 


FE. 


VOLUME55 NUMBER 290 


v 

400 


American Statistical Association 


Organized November 27, 1889, Incorporated 1841 


The American Statistical Association is a scientific and educational organi- 
zation, Its membership is not confined to professional statisticians but includes 
economists, business executives, research directors, government officials, uni- 
versity professors, and others who are seriously interested in the application of 
statistical methods to practical problems, in the development of more useful 
methods, and in the improvement of basic statistical data. Engineers, mathe- 
maticians, biologists, actuaries, sociologists, psychologists, and representatives of 
many other professions are included in the membership of the Association. 


Annvat Dvugs* 
Residents of North America.... $ 8 Institutional membership, mini- 


5 

Student membership 6 
Introductory membership (first 
dues payment of members un- 


Member’s subscription to The 


* Includes subscription to the Journal of the American Statistical Association and to The Ameri- 


can Statistician. 


Subscription rate: $8 per year. Prices for back issues available on request. 
For further information about the Association and membership application 
forms, write the Secretary, 1757 K 8t., N.W., Washington 6, D. C. 


The present Institutional Members are: 


American Telephone & 
Telegraph Company 

Armour & Company 

Armstrong Cork Company 

Bankers Trust Company 

Bell Telephone Laboratories 

California Texas Oil Company, Ltd. 

Chrysler Corporation 

Deere & Company 

Dun and Bradstreet, Inc. 

Eastman Kedak 

The Equitable Life Assurance 
Society of the U. 8. 

Ford Motor Company 

General Dynamics Corporation 

Genera! Motors Co: tion 

Humble Oil and Refining Co. 

International Brotherhood of Team- 
sters, Chauffers, Warehousemen & 
Helpers of America 

International Business 
Machines Corp 

International Harvester Company 


International Ladies Garment 

etro tan e Insurance 
National Analysts, Inc. 
National Association of Manufacturers 
National Industrial Conference 


New York Telephone Co 

ew Yor ephone Compan 
Philadelphia Electric 
Schering Corporation 

Sears Roebuck & Compan 

Socony Mobil Oil Company, Inc, 


Standard Oil Company of 
Statistical 
Sun Oil Co pany 

Touche & Smart 
Union Carbide Corporation 
United States Steel 

The Wellington Fund, Ine. 
Western Electric Company, I 
Young & Rubicam, 


ew 


tion 


. 
Be 100 
5 
6 = 
iia der 30 years old)............ 5 


JOURNAL OF THE AMERICAN 
STATISTICAL ASSOCIATION 


The Editors welcome the submission of manuscripts for possible publication. They 
should be typewritten entirely double-spaced, including footnotes, and two copies should 
be sent to the Editor, Clifford Hildreth, Wells Hall, Section A, Michigan State University, 
East Lansing, Michigan. Books for review should be sent to the same address. Unsolicited 
book reviews are not accepted, but suggestions of titles for review are welcome. 


EDITOR 
Currrorp Hitpreta, Michigan State University 


ASSISTANT TO THE EpIToR 
Dorotuy B. BoEDECKER 


ASSOCIATE EDITORS 


RosBertT FERBER Marc NERLOVE 
University of Illinois University of Minnesota 


G. Mapow HERBERT SOLOMON 
Stanford Research Institute Stanford University 


ADVISORY PANEL OF FORMER EDITORS 


G. Cocuran (1945-50) Freperick F. StepHan (1935-40) 
Harvard University Princeton University 


FraAnkK A. Ross (1926-34, 41-5) W. ALLEN WALLIs (1950-60) 
Thetford, Vermont University of Chicago 


LIAISON EDITORS FROM SECTIONS 


Biometrics: Jerome Cornfield, Johns Hopkins University, Baltimore, Mary- 
land 

PHYSICAL AND ENGINEERING Sciences: Irving Burr, Purdue University, 
Lafayette, Indiana 

TRAINING OF StaTisTiciANs: John S. Freund, Arizona State College, Tempe, 
Arizona 

Liaison EpIToRS FROM THE BusINESS AND Economic Statistics SECTION 
AND Socrat Sratistics SECTION TO BE ANNOUNCED 

Corrigenda: Readers and authors are urged to submit to the Editor notices of errors 

found in this or any previous volume. These will be published once a year, in the 

December issue. 


Published Quarterly by the AMERICAN STATISTICAL ASSOCIATION 


Publication Office: Curtis Reed Plaza, Menasha, Wisconsin. Editorial ORes: Wells Hall, Section A, Michigan State 
University, East Lansing, Michigan. Business Office: 1757 K St., N.W., Washington 6, D.C. Acceptance for mailing 
at specia rate of postage provided for in the Act of February 28, 1925, embodied i paragraph 4, section 588, 
P. L. & R., authorized March 25, 1936. Second class postage paid at Menasha, Wisc 


nsin. 
Anyone wishing to change - mailing address should — eight weeks notice. A pee of the address taken from 
an issue of the periodical shoul: pany the nge of address request. 


Elementary Statistics* 


By PAUL G. HOEL, University of California, Los Angeles. Designed 
for those with only a limited background in mathematics, this book intro- 
duces the fundamental ideas of statistical methods in a simple, systematic 
manner. Complete course flexibility is possible since the author has included 
both elementary statistical topics and four special chapters of a somewhat 
more advanced nature. These chapters deal with the chi-square distribution, 
time series and index numbers, nonparametric tests, and analysis of variance. 
Time series and index numbers will appeal to the economist, while the 
social scientist will find interest in the nonparametric tests. 

1960. 261 pages. Illus. $5.50. 


Mathematical Methods for Digital Computers 


Edited by ANTHONY RALSTON, Bell Telephone Laboratories; and 
HERBERT S. WILF, University of Illinois. The problems included in this 
book have been drawn from all branches of applied mathematics—and each 
illustrates a particular method of numerical analysis. Each chapter affords 
both a thorough mathematical treatment of a given method and a detailed 
account of its reduction to digital computer terms. The solution of the vari- 
ous problems involves not only the more familiar computer techniques but 
also several new and highly promising procedures and tactics (e.g., the 
Monte Carlo method). Each chapter of the book represents the contribution 
of a man in close contact with the latest developments in his field. 

1960. 293 pages. Illus. $9.00. 


Modern Probability Theory and Its Applications* 


By EMANUEL PARZEN, Stanford University. Aside from demonstrat- 
ing the uses of probabilistic ideas and techniques, the author emphasizes the 
intrinsic value of the basic concepts of probability theory. He shows how 
these concepts may be employed to formulate probability problems in a 
mathematical manner, so that they may be systematically approached by 
routine methods. The author covers both discrete and continuous proba- 
bility. The work contained in the book does not require an advanced mathe- 
matical background beyond calculus. Included are a large number of ex- 
amples; over 120 theoretical exercises; and more than 480 non-theoretical 
exercises that are used to illustrate the variety of situations to which prob- 
ability theory may be applied. 


1960. 464 pages. Illus. $10.75. 


*One of the Wiley Publications in Statistics, 
Walter A. Shewhart and S.S. Wilks, Editors. 


Send for your examination copies. 


JOHN WILEY & SONS, Inc. 
440 Park Avenue South New York 16, N.Y. 


Please mention the Journal of the AMERICAN STATISTICAL ASSOCIATION in writing advertisers 


i 
BOOKS 
: 
: 


JOURNAL OF THE AMERICAN 
STATISTICAL ASSOCIATION 


VotumeE 55 June 1960 NumBzpe 290 


ARTICLES 


A Multiplicative Model for Songs Variances which are Affected by Several 
Factors .. . . Ropert E. BEcHHOFER 


Certain Statistics . . .  Rosert V. 


Changes in Rate and — of Household Formation . 
SHERMAN J. Maisen 


STEPHEN SPIEGELGLAS 


Optimal of W eighted . « Joun Mutu 


when the Control is Fallible 
Freverrc M. Lorp 


Variance of the Median of a Distribution . R. RIpER 


Tests of the that a Linear stem Obeys Two Separate Re- 
gimes . .  Ricwarp E. QuANDT 


A Note on Tau as a of . L. Hays 


Estimation in the Truncated Poisson Distribution when Zeros and Some Ones are 
Missing. . . A. Ciirrorp CoHEN, JR. 


NOTES ABOUT AUTHORS. 
SUMMARIES OF PAPERS DELIVERED AT 9TH ANNUAL MEETING 
PUBLICATIONS RECEIVED . 
REPORT OF THE BOARD ~ DIRECTORS “AND SECRETARY- TREAS- 
URER FOR 1959. 
BOOK REVIEWS 


LiprittT, VERNON G., Determinants House Furnishings and 
Equipment . . RicHarp STONE 


Menges, GUNTHER, Stichproten aus Gesamtheiten (Sampling from finite 
populations) . . Herman Wo tp 


KeMENY, Joun G., Lacan, AND THOMPSON, GERALD 
L., Finite Mathematical Structures . . . Patrick 


Tuer, H., Economic Forecasts and Policy . . . . . .  V. Lewis BassiE 


MILBANK Funp, of in Human Fertility 
Evetyn M. Krracawa 


U. OF AND OrricE oF EpucatIoN, 
— of State School Systems: 1966-656, Organization, Staff, Pupils, and 
inances 
Current Expenditures Per Pupil in Public School Systems: Small and Medium- 
Sized Cities, 1956-57. Circular No. 638 
Participation in "Adult Education. Circular No. 589 
Statistics of Land-Grant ead and U aaeed Year Ended June 30, 1957. 
Circular No. . Francis G G. Cornet 


SrupEnskI, Paut, The Income of Nations « 
GuMBEL, E. J., Statistics of Extremes. . . . . 


Haaaarp, Ernest A., Intraclass Correlation ont the Sr of Variance . 


(continued on page tv) 


4 

245 
265 
268 
284 
299 

307 
322 
324 
331 

342 
349 
350 
400 
402 
375 
376 
377 
378 
379 
381 
382 
383 
384 


Hoop, Wo. C., Financing of Economic Activity in Canada . E. F. Beacu 


CHAKRAVARTY, S., The Logic of Investment Planning . . B. CHENERY 
Lona, CLARENCE D., The Labor Force under Income and 
M. W. REDER 


URBAN The World’s Metropolitan 
Henry S. Suryock, Jr. 


BE. AND Manpower and Innovation in Ameri- 


. . CLARENCE D. Lone 
Bootru, ANDREW D., Automation and Computing . . . GeorGE E. ForsyTHE 
ARMOUR RESEARCH of the Annual Computer Appli- 
cations Symposium B. E. 
UniTep Nations, Foop anp LTURE Report on the 1950 World 
Census of Agriculture, Volume II, Census Methodology . UD. GALE JOHNSON 


JOHN AND A. L., Handbook of Area Sampling . 
G. Mapow 


S., AND WISHART, ‘Editors, “Student’ s” Collected Papers. 

Vaspa, 8., Readings in Linear Programming . . . Lions 

Fern, Rasut, Economics of Mental Illness. Joint Commission on Mental Illness and 

STATISTICAL OFFICE OF THE UNITED Nations, United Nations Demographic Yearbook 
1968. . . . . . W, Barciay 


Voat, Evan Z. , Ray, Water Witehing, U. A. 


Coun R. 


387 


388 


389 


390 
391 


392 


393 


394 
394 
394 


396 


397 


399 


120th ANNUAL MEETING Meeting jointly with: 


of the ® The Institute of Mathe- 
matical Statistics 

AMERICAN ® Biometric Society, ENAR 
and WNAR 


Stanford University, Stanford, Calif. | ° Western Economic Asso- 
AUGUST 23-26, 1960 © Western Farm Economic 


Association 


STANFORD is located about 30 miles south of San Francisco, one of the country’s 
most interesting cities. San Francisco Bay with its bridge, the Golden Gate, Golden 


Gate Park, the steep hills with their picturesque cable cars, the Mission Dolores, 
Chinatown, the Embarcadero and Fishermen's Wharf, the fine restaurants and 
famous night spots, all make San Francisco a visitor's delight. Across the Bay are 
Berkeley and the University of California. Just north of the Golden Gate are Mt. 
Tamalpais and the Muir Woods with their giant redwoods. About 80 miles south of 
Stanford is the Monterey Peninsula with its famous 17-mile drive, the old city of 
Monterey—the capital of Spanish California, and the charming town of Carmel— 
favorite of writers and artists. Down the coast are many attractions of Los Angeles 


and San Diego. 


The Palo Alto climate is at its best in August. Days are sunny and warm, but 


not humid. Nights are cool. 


Details of registration, room reservations and program will appear in the June 
issue Of THE AMERICAN STATISTICIAN and in forthcoming mailings. Enter the dates 


on your late summer calendar. 


Please mention the Journal of the AMERICAN Statistical AssociaTION in writing advertisers 


+ 
ee: 
He 
|| 
; 


1960 OFFICERS, AMERICAN STATISTICAL ASSOCIATION 


Board of Directors 


President President-Elect Past President 
Morris H. Hansen MartTIN R. GAINSBRUGH Rensis LIKERT 


Vice-Presidents 


(1958-60) (1959-61) (1960-2) 
Howarp L. JoNnEs Guy Orcutt Georae E. P. Box 


Directors 


E. Bureess (1958-60) Raymonpd T. BowMAn (1959-1) 
Oscar KEMPTHORNE (1958-60) Harowp F. Dorn (1960-2) 
A. H. Bowker (1959-61) FREDERICK MostTELLER 


Secretary-Treasurer & Executive Director 


Dona.p C. RILEY 


Members of the Council 


CHARLES M. ARMSTRONG E. J. Enequist, Jr. James W. KNowLes 

GEOFFREY BEALL Wa .terR T. FEDERER GERALD J. LIEBERMAN 

Hvuas H. Brown FERBER Artuour S. 

J. PARKER Bursk Spencer M. Fres, Jr. JoHN THORNTON MARSHALL 

Matcoum B. CatLin Ricuarp A. FreuND Pavut MEIER 

Epwin L. Crow Maurice I. GERSHENSON Louis J. PARADISO 

CuTHBERT DANIEL: Joun I. GRIFFIN ALMARIN PHILLIPS 

Hersert A. Davip Currrorp G. HILpRETH R. Rimmer 

Hersert T. Davip DaniEL G. Horvitz Rosert B. Spears 

Paut M. DENSEN J. Sruart HunTER FREDERICK F. STEPHAN 

Davin B. Duncan Davip V. HUNTSBERGER WHITE 

Artuur M. DutTron R. E. JoHnson Martin B. WILK 
NaTHAN KEyYFITz 


Section Chairmen for 1960 


Meter, Biometric Section 

James W. KNow .gs, Business and Economic Statistics Section 
Paut E. Riper, Section on Physical and Engineering Sciences 
Freperick F. Stepuan, Social Statistics Section 

Rosert FersBeER, Section on Training 


Tables and other aids to computation appearing in this Journal are abstracted and 
indexed in Mathematical Tables and Other Aids to Computation. 


= 


EDITORIAL COLLABORATORS 


HELEN AsseEy, Johns Hopkins University 

T. W. AnpERsON, Columbia University 

FrANK J. ANSCOMBE, Princeton University 

Cart Artvt, Federal Reserve Bank of St. 
Louis 

Jacos BEARMAN, University of Minnesota 

AGNES BERGER, Columbia U niversity 

JOSEPH BERKSON, Mayo Clinic 

Sruart A. Bessier, Stanford University 

MARTIN BRONFENBRENNER, University of 
Minnesota 

Byron Brown, 
Minnesota 

H. D. Brunk, University of Missouri 

CaGcan, Brown University 

W ¥; LIAM G.Coc HRAN, Harvard University 

CouEN, Jr., U niversity of Georg 

W. ‘S. Connor, National Bureau of "Stand- 
ards 

Jean Crockxert, Wharton School of Finance 

Epwin Crow, National Bureau of Standards 

Wa ter L. Deemer, Department of the Air 
Force 

W. Epwarps Demina, New York University 

Lucttte Derrick, University of Illinois 

Acueson J. Duncan, Johns Hopkins Uni- 
versily 

BEVERLY Duncan, University of Chicago 

Duptey Duncan, University of 
Chicago 

Lita Exvvesack, Tulane University 

BENJAMIN Epstein, Stanford University 

Watrer T. Feperer, Cornell University 

MARIANNE FERBER, University of Illinois 

Joun Frrestone, City College of New York 

SupuisH GuuryE, Northwestern University 

Joun P. Gitpert, Center for Advanced 
Study in the Behavioral Sciences 

S. Cotum GILFILLAN, Chicago, Illinois 

pang C. Guick, Bureau of the Census 
Leo A. GoopMAN, University of Chicago 

Ku Stanford University 

Z. E. GREVILLE, Department of Health, 
and W elfare 

F. E. Gruss, Ballistics Research Labora- 
tories 

Joun Gurtanp, Towa State College 

MARGARET GURNEY, Bureau of the Census 

Ropert L. Gustarson, Michigan State 
University 

M. HAurerin, General Electric Co. 

FRANK A. Hanna, Duke University 

H. O. Hartey, Jowa State College 

Putri M. Hauser, University of Chicago 

Wassity Horrrpine, University of North 
Carolina 

Austin C. Hoaaart, University of Cali- 
fornia, Berkeley 

VERNON Jouns, Stanford University 

V. Jones, University of North Carolina 

J. H. B. Kemperman, Purdue University 

Oscar KEMPTHORNE, "Iowa State College 

Braprorp F. KIMBALL, Public Service 
Commission 

Kisu, University of Michigan 

GrorGE Kierner, Universily of Illinois 

WituraM H. KRUSKAL, University of Chi- 
cago 

Roy KvEBLER, JrR., University of North 
Carolina 

Jack LADERMAN, Office of Naval Research 

STANLEY LEBERGOTT, fice of Statistical 
Standards 

GERALD LIEBERMAN, Stanford University 


University of 


GarRIE LosEE, Bureau of the Census 

Eur Marks, National Analysts, Inc. 

ALBERT W. MarsHatu, Stanford University 

FraNK Massey, University of California, 
Los Angeles 

Tep K. Marrues, Columbia University 

F. Scorr Maynss, University of Minnesota 

Paut Meter, University of Chicago 

James MerGs, Federal Reserve Bank of St. 
Louis 

Paut Meyer, Washington State University 

SaMuEL Messick, ducational Testing 
Service 

Rupert G. Jr., Stanford Univer- 
sity 

W. Federal Reserve 
Bank of Chicago 

Lincotn E. Moses, Stanford University 

R. B. Murpuy, Bell Telephone Laboratories 

Marc NeEr.ove, University of Minnesota 

Haro.wp NIssE.son, Bureau of the Census 

Artuur M. Oxun, Yale University 

Ricuarp H. OstHermer, Life 

D. B. OwEN, Sandia Corporation 

EMANUEL ParzEN, Stanford University 

JouN Pratt, Harvard University 

FRANK ProscuHan, Electronic Defense Lab- 
oratory 

Ronap Pyke, Columbia University 

Rerrer, Purdue University 

— R. River, Wright Patterson Air Force 

A. KimsBatut Romney, Stanford University 

Joan R. Rosensuatr, National Bureau of 
Standards 

ABRAHAM RotuMaN, Bureau of Labor Sta- 
tistics 

Haroup RuBEN, Columbia University 

GERALD Sacus, Columbia University 

I. Ricuarp SavaGe, University of Minne- 
sota 

Henry D. SHevpon, Bureau of the Census 

Suiskin, Bureau of the Census 
S. SHRIKHANDE, University of North 
Carolina 

Monroe G. SrrkeEN, National Office of 
Vital Statistics 

Morris SKriBinsky, Purdue University 

Rosert M, Soiow, Massachusetts Institute 
of Technology 

R. Sprow University of California, 
Los Angeles 

GrorceE J. Sticuer, University of Chicago 

DaniE_ B. Suits, University of Michigan 

F. Taytor, University of Cali- 
fornia, Berkeley 

LazarE TEPER, [nternational Ladies’ Gar- 
ment Workers’ Union 

F, E. Terman, Stanford University 

Haroup A. Tuomas, Jr., Harvard Uni- 
versity 

GERHARD TINTNER, Jowa State College 

Vickery, Columbia University 

D. F. Voraw, Jr., Yale University 

Davin L. University of Chicago 

Harry WEINGARTEN, Department of the 

LioneL Wertss, Cornell University 

Joun 8S. Wuite, Minneapolis-Honeywell 
Regulator Co. 

W. Max Woops, Stanford University 

Victor ZARNowIt1z, University of Chicago 

Marvin ZELEN, National Bureau of Stand- 
ards 


al 


JOURNAL OF THE AMERICAN 
STATISTICAL ASSOCIATION 


Number 290 JUNE 1960 Volume 55 


A MULTIPLICATIVE MODEL FOR ANALYZING VARIANCES 
WHICH ARE AFFECTED BY SEVERAL FACTORS* 


Rosert E. BEcHHOFER 
Cornell University 


. Introduction 
. The two-facter experiment (univariate response) 
3. Experimental interpretations of the models 
3.1 Factorial experiments 
3.2 Experimental designs (randomized complete blocks, etc.)................. 
3.3 Factorial experiments used with particular experimental designs........... 
3.4 Response surfaces 
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4.1 Point estimation 
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4.3 Hypothesis testing 
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4.3.2 The univariate general multiplicative hypothesis 
4.4 Multiple-decision problems 
. The two-factor experiment (multivariate response) 
5.1 Some particular multivariate 
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. Acknowledgment 
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A multiplicative model is proposed for analyzing multifactor experi- 
ments which are conducted to study the effect of changes in the levels 
of the factors on the variance of a chance variable. The model is a 
direct analogue of the additive model for means. Univariate and multi- 
variate applications to factorial experiments, experimental designs, 
and multiplicative response surfaces for variances are discussed. 


1. INTRODUCTION 


RACTICALLY all of the early and fundamental research in the theory and 
Prrslysis of experimental designs concerned itself with experiments which 
were conducted to estimate the mean response of a chance variable. This was 
probably so because the experimenters who prompted the development of such 
designs were the agriculturists, and in agriculture one of the most useful ways 


* This research was supported in part by the Army, Navy and Air Force under Contract N6onr-25126 (NR 
042-002) with the Office of Naval Research and in part by the U. S. Air Force through the Air Force Office of 
Scientific Research of the Air Research and Development Command under Contract AF 49(638)-230 and was 
written while the author was on sabbatical leave at Stanford University from the Sibley School of Mechanical 
Engineering, Cornell University. 
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of characterizing the “goodness” of the varieties under study is in terms of their 
mean yields. 

In recent years engineers have come to appreciate the role that statistics 
plays in the design, analysis, and interpretation of their experiments. Since 
the engineers too are often interested in estimating the mean response of a 
chance variable, for some of their problems they were able to take over bodily 
(or after suitable modifications) most of the theory and analysis of experi- 
mental designs which had been developed for the agriculturists. 

However, in many of their problems the engineers, unlike the agriculturists 
are mainly interested in estimating the variance of response of a chance vari- 
able. This is particularly true, for example, with military weapons where the 
target dispersion of the weapon can be measured in terms of the variance; with 
measuring instruments (such as scales) where the precision of the instrument 
can be measured in terms of the variance; and with production processes where 
the determination of whether or not a measured characteristic can be held 
within specification limits will depend on the variance of that characteristic. 
Unfortunately, there has not yet been developed any general theory of experi- 
mental designs which is applicable to experiments which are conducted to esti- 
mate the variance of response of a chance variable. Procedures for the statistical 
analysis of such experiments are in a far more rudimentary state, even for the 
important special case in which the responses are normally distributed. 

In this paper we propose a new! way of looking at multifactor experiments 
which are conducted to study the effect of changes in the levels of the factors 
on the variance of a chance variable. A mathematical model is proposed which 
assumes that the effect of changing a factor from “level” 1 to “level” 2 is to 
multiply the original variance of the chance variable by a positive constant; 
this is in contrast to a model which assumes (say) that the effect of changing 
a factor from “level” 1 to “level” 2 is to add a constant to the original variance 
of the chance variable. Thus a multiplicative model allows one to speak of 
relative (percentage) changes; an additive model allows one to speak of absolute 
changes. 

It would appear that the multiplicative model for variances will be meaning- 
ful in many diverse experimental situations—for example in experiments in- 
volving the target dispersion of a weapon, the life of an electron tube, or the 
radioactivity of an element where the associated chance variables might be 
assumed to be normally, exponentially, or Poisson distributed, respectively. 
Of course, the apprepriate statistical analysis for a particular experiment will 
depend on the distribution of the underlying chance variable. 

As we define it, the multiplicative model for variances can be viewed as a 
direct analogue of the additive model for means, and all of the terminology and 
concepts carry over immediately. However, although experimental designs 
play essentially the same role in experiments which are conducted to study 
variances as they do in experiments which are conducted to study means, the 


1 When this paper was in an advanced draft stage, two papers [23], [24] (at the time unpublished) by Zelen came 
to the attention of the writer. Zelen proposed one of our univariate multiplicative models, namely (7), with special 
reference to the exponential distribution and life-test problems. There are other points of overlap between our papers. 
However, the point of view adopted in our paper is quite different from Zelen's, and our approach is more general. 
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statistical analysis of the results obtained in the two types of experiments will 
be markedly different. Even in the case in which the chance variables can be 
assumed to be normally distributed, exact statistical procedures for analyzing 
multifactor experiments involving variances have scarcely been considered 
while the corresponding procedures for analyzing experiments involving means 
are very highly developed. 

The paper is organized along the following lines: The univariate multiplica- 
tive model for variances is introduced in Section 2. Its experimental interpreta- 
tion in classical terms is given in Section 3. Factorial experiments, experimental 
designs, and the concept of an approximating multiplicative response surface 
for variances are also described in this section. Some of the unsolved statistical 
problems which arise from the model are pointed up in Section 4. It is shown 
how the use of the log transformation of the sample variances, when the under- 
lying variables are normally distributed, makes it possible to employ standard 
procedures for analyzing the data and thereby obtain excellent approximate 
results. Consistent point estimates of the parameters of interest are proposed. 
It is shown how a large class of hypothesis-testing problems can be viewed as 
a special case of what is termed the univariate general multiplicative hypothesis 
for variances—an analogue of the univariate general linear hypothesis for 
means. In Section 5 the multiplicative model for variances is generalized to 
the multivariate case, some additional unsolved statistical problems are indi- 
cated, and a practical problem for which the multivariate multiplicative model 
might be appropriate is described. 

In the various sections most of the ideas are illustrated using the two-way 
classification, i.e., using two-factor experiments. It will be clear how these 
ideas generalize to multifactor experiments. 


2, THE TWO-FACTOR EXPERIMENT (UNIVARIATE RESPONSE) 


Suppose that one is considering reN independent observations Xj 
2,---, Ni;) ina two-way layout. For 
simplicity of notation the special case N,;=N (for all 7, 7) will be treated al- 
though the ideas set forth below hold for the general case in which the N,; can 
vary from cell to cell. It is asswmed that the population mean and the popula- 
tion variance of X,;, depend only on the particular cell in which the observa- 
tion is located, i.e., that 


E{ Xin} = @ =1,2,---+, 737 = 1,2, - 


(Xin — wis) } = 


where ¢;;?>0. One can define 


= ( Il ci) 


i=1 j=1 


Gj I/e r o \lre 
( II vi) /( Il 
j=l t=1 j=l 
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fk 
2 r c 2 l/re 
i=1 j=l 
= l/r lle (2 1, 2, a, “~) c) (6) 
2 2 
(Ici) (Is) 
i=1 j=l 
and write 
2 2 : : 


Here o?>0, the a;>0, 8;>0, and y;;>0 are parameters; because of the way in 
which they are defined, they satisfy the equations 


II a=1 


II v= 1 


t=1 


(8) 
II 4; =1 II = 1. 
j=1 j=l 


We term (7) a univariate multiplicative model for variances. (Another such model 
for the situation in which Factors A and B are quantitative is given in Section 
3.4.) Mathematically it is, of course, always legitimate to write (7) since it 
merely states an algebraic identity. However, in any particular practical prob- 
lem the legitimacy of assuming this model must be based on the experimental 
situation under consideration. In Section 3 the parameters a;, 8; and ¥;; are 
given an experimental interpretation. 

In some problems the form of the underlying distributions of the X ;;. is such 
that it can be written f(x; 61, 42) =f((x—61)/@2), i.e., the distribution can be 
characterized by a pure location parameter and a pure scale parameter. This 
would be the case, for example, with the normal distribution or with the rec- 
tangular distribution 


1/6. for S (x — +3 


0 otherwise. 


fla; 01, 02) = (9) 


For such distributions it often is meaningful to consider simultaneously the 
multiplicative model (7) for variances and the following univariate additive 
model for means: 


(10) 
Here u, the af, 8% and y% are parameters defined by 


i=1 j=l 


- 

| 
2 
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i (11) 

w= mis/re 11 


MULTIPLICATIVE ANALYSIS OF VARIANCE 


i=1 j=l 


B; = ( G =1,2,---,¢) 


i=1 i=1 j=l 
Vii = Big — ( ws/e) ( + ( > 
j=l t=1 i=1 j=l 
-,r;j7=1, 2, €). 
The parameters satisfy the equations 


t=1 
8; = 0 = 0. 
j=l 


As with the multiplicative model (7) for variances, the legitimacy of assuming 
the additive model (10) for means must be based on the experimental situation 
under consideration. Let 


Xy= DL Xia/N (16) 
k=1 


and 


= (Xm — - 1)@=1, =1,2,-+-+,¢) (17) 


denote the usual unbiased estimates of yu; and of. The statistics (16) and (17) 
(which are based on N21 and N22 observations, respectively) provide un- 
biased estimates of their associated parameters without regard to the form of 
the underlying distribution of the X;;,. Of course, if the form of the distribution 
is known, then these estimates can be replaced by the appropriate unbiased 
estimates. (For example, if the underlying distribution of X is (9), then (16) 
can be replaced by (U+V)/2 which is an unbiased estimate of uz, and (17) can 
be replaced {(N+2)(N+1)(V—U)?}/{12N(N—1)} which is an unbiased esti- 
mate of where U=min (Xi, Xv) and V=max (X,, ---, Xy).) 

The additive model (10) for means is well known and widely used. In fact, it 
is the usual fixed effects model underlying most of the analyses of experiments 
involving means which are affected by several factors. (See, for example, 
Cochran and Cox [7], Kempthorne [16], or Scheffé [19].) However, the multi- 
plicative model (7) for variances appears not to have been proposed hereto- 
fore,? although it seems to the author to be a very reasonable one for analyzing 
experiments involving variances which are affected by several factors. 


2 See footnote 1. 


(13) 
(14) 


250 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1960 


3. EXPERIMENTAL INTERPRETATIONS OF THE MODELS 


3.1 Factorial experiments. The Xj;’s of Section 2 can be viewed as the out- 
come of a two-factor experiment which was conducted to study the “effect” 
of these factors on the variance of the chance variable, on the mean of the 
chance variable, or (for certain problems) on both the variance and the mean 
of the chance variable, simultaneously. The first factor, Factor A, was studied 
at r “levels” while the second factor, Factor B, was studied at c “levels”; the 
experiment was a complete factorial, all re treatment combinations having 
been used. 

If the univariate multiplicative model (7) for variances is appropriate for the 
experiment, then the a;, 8;, and y;; can be given the following interpretations: 


a; is the main “effect” of the 7th level of Factor A 
8; is the main “effect” of the jth level of Factor B (18) 
i; is the first-order interaction “effect” of the ith level of Factor A 

and the jth level of Factor B. 


In (18) all “effects” are on the variance of the chance variables. If the univariate 
additive model (10) for means is appropriate for the experiment, then the af, 67, 
and 7%, can be given similar interpretations, all “effects” being on the mean of 
the chance variables. 

The interpretations given in (18) are, of course, the natural analogues of the 
conventional interpretations given to the “effects” in the additive model (10). 
As might be anticipated, for each of the conventional concepts associated with 
the additive model (10) there is a natural analogue associated with the multi- 
plicative model (7). This correspondence should be borne in mind throughout 
the remainder of this paper, and it will contribute materially to an under- 
standing of the multiplicative model. In what follows we shall concern ourselves 
almost exclusively with the multiplicative model for variances, and shall assume 
that the reader is familiar with the additive model for means. 

3.2 Experimental designs (randomized complete blocks, etc.). The X ;;’s of Sec- 
tion 2 also can be viewed as the outcome of a two-factor experiment which was 
conducted to study the “effect” of one factor on the variance of the chance 
variable, at the same time eliminating the “effect” of the other factor on the 
variance of the chance variable. Thus, one might be considering a randomized 
complete blocks experiment in which Factor A, the factor under consideration, 
was studied at r “levels” while Factor B, the factor the “effect” of which is to 
be eliminated, was studied at c “levels,” i.e., there are c blocks each containing 
r “plots.” Then ¥;; is the interaction “effect” of the 7th “treatment” and the 
jth “plot” on the population variance; in many experiments it would be appro- 
priate to assume that y,;=1 ({=1, 0c), 1.e., that there 
is no block-treatment interaction “effect.” The randomized blocks model (and 
related models) would appear to be particularly appropriate for many problems 
in the physical and engineering sciences since it is often quite reasonable in 
these problems to assume that the block is a homogeneous unit. 

It is immediately obvious how to generalize the multiplicative model (7) for 
variances so that experiments involving more than two factors can be analyzed. 


Fe 
; 
~ 
= 
4 
Pe 
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The method of setting up experiments involving balanced incomplete blocks 
is also clear, as is the method of generalizing the multiplicative model (7) so 
that experimental designs such as Latin squares, cross-overs, etc. can be 
employed. 

3.3 Factorial experiments used with particular experimental designs. The uni- 
variate multiplicative model (7) can easily be generalized so that it will be ap- 
propriate for a multifactor experiment used in conjunction with a particular 
experimental design. Thus, for example, a 23 complete factorial experiment 
which is conducted to study the “effects” of the two factors on the variances 
of the chance variables can be embedded in a 6X6 Latin square which is used 
to eliminate the “effects” of two additional factors on the variances of the 
chance variables. 

3.4 Response surfaces. The multiplicative model (7) for variances is appro- 
priate when the factors under study are qualitative, guaniitative, or both. How- 
ever, when all of the factors which are being studied are quantitative, a more 
meaningful analysis will probably be obtained with the following approxi- 
mating multiplicative model: 


B} — } = I II 
(a€@,b€B;k =1,2,---,N) 


where 


= @E€@bE@;k =1,2,---,N) (20) 


and d,,;>0 (¢=0, 1, -- +, ma; j=0, 1, For distributions involving a 
pure scale and a pure location parameter it may be meaningful to consider 
simultaneously the following approximating additive model: 


i=0 j=0 


In the above, a and b which are the amounts of the quantitative Factors A 
and B, respectively, can take on values in the regions @ and @. The ’s and 
the \*’s are parameters. The r.h.s. of (19) is an approximating multiplicative 
response surface of degree m, in a and m, in b, while the r.h.s. of (21) is an ap- 
proximating additive response surface of degrees nq in a and n, in b. It is straight- 
forword to generalize both (19) and (21) to multifactor experiments. And, of 
course, both (19) and (21) can be used in conjunction with the various experi- 
mental designs. 

It might be remarked that if the experimenter wishes to make statistical in- 
ferences about the parameters of the models (19) and (21) he is not restricted 
to taking his observations Xa.;x (k=1, 2, - - - , N) in a rectangular grid in the 
@X@ region. However, certain patterns of observation-taking will lead to 
more efficient designs and/or to computational advantages. If the a’s and b’s 
are equally spaced in a rectangular grid, then for the multiplicative model (19) 
it is possible to define Ist, 2nd, . . . degree “effects” of Factors A and B and 
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the associated interaction “effects” by analogy with the definitions for the addi- 
tive model (21) [10, pp. 344-52]. 


4. STATISTICAL PROBLEMS ARISING FROM THE MODELS 


The multiplicative models (7) and (19) lead to many new, interesting, and 
for the most part unsolved statistical problems. Some of these problems are 
raised in this section. However, before considering these problems it will be 
helpful to note an interesting and useful relationship between the multiplica- 
tive models (7) and (19) for variances and the additive models (10) and (21) 
for means. The relationship between models (7) and (10) will be discussed first. 

Consider the chance variables 


2 
Yi; = log. $i; =1,2,---,¢) (22) 


where 52, is defined in (17). Now it is known [1] that if the Xj, are normally 
and independently distributed according to (7), then 


1 id 
5a) + O(n’) (23) 


1 
BL Ys} = log — (—+ 
n  3n? 


and 
2 
Var{ = — O(n) = —— + On) (24) 


where n = N —1. It also can be shown that the distributions of the Y,; approach 
normality as n increases. From (7) it is seen that 


log on = log e + log a; + log 8B; + log yi). (25) 
Hence, under the assumption of normality of the X;; one obtains 
1 
E{ Yu} ~ log o? + log a; + log 8; + log yi; — — (26) 
2 : 
r ij ~~ = 
n-1 N-2 (27) 


and the multiplicative model (7) for variances has been transformed approxi- 
mately (the approximation improving rapidly with increasing N) to the addi- 
tive model (10) for means (which now have a constant variance) ; log o?, log a, 
log 8;, and log yi; play the role of u, af, 87, and yj, respectively, and satisfy 


the equations (15). 
Similarly, for model (19) if one assumes the normality of the X,»,, and makes 


the transformation 


(@€@,bE€@) (28) 


(where $2, is defined in the obvious way), then 


ag 

} 
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i 
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ma mb 


E{ Yas} ~ adélog rj, — (29) 


imO jm 


(30) 


Var{ N 


and again the multiplicative model (19) for variances has been transformed 
approximately to the form of the additive model (21) for means with log \,; 
playing the role of dj. 

Under the assumption that the multiplicative model (7) for variances is ap- 
propriate and that the X ,; are normally and independently distributed, it turns 
out that many of the statistical problems that are raised below concerning 
variances can be solved approximately by using the log transformation (22) 
and then employing the Model I [13] theory which has already been developed 
for the additive model (10) for means. In general these approximations can be 
expected to be very good, not only because the Y,; are approximately normally 
distributed and the additive model holds approximately, but also because many 
of the procedures involve the use of sums of the Y,;, and these sums have dis- 
tributions which are even closer to the normal than those of the individual Y,;. 
The fact that the variances of the transformed variables (22) are known (ap- 
proximately) will often be helpful in the design and analysis of the experiment. 
(The variances of the X;; (16) are usually not known for the additive model 
(10).) For example, it is possible to test for the existence of interactions. Clearly 
all of the foregoing also holds for the multiplicative model (19). 

It is important to emphasize that it is only a happy coincidence that when 
the underlying distributions of the chance variables are normal, the log trans- 
formation accomplishes two unrelated objectives. It not only transforms 
(approximately) the multiplicative model for variances to the additive model 
for means, but it also yields a new set of chance variables (the log $*) which 
have (approximately) a constant variance. (And incidentally, the distributions 
of the new variables are closer to the normal than those of the original set 
(the s*).) For it was this variance stabilizing property that originally led to the 
use of the log transformation in the Analysis of Variance. (See, for example, 
Bartlett [2] and Curtiss [8].) More recently the log transformation has been 
proposed as a method of minimizing the effect of possible nonnormality (of 
the X’s) on the operating characteristics of the test procedure. (See, e.g., 
Scheffé [19, pp. 83-7].) It can be anticipated that a log transformation will 
achieve results similar to those described above only for certain classes of 
distributions (see, e.g., Zelen [23, p. 286]), but that in general a different ap- 
proach will have to be used. 

One way of checking the “goodness” of any proposed approximation is by 
comparing the results obtained with it to those obtained using exact small- 
sample theory. This is being done for a particular class of problems [4]. Part 
of the mathematical challenge will come from the attempt to develop such 


new theory. 
4.1 Point estimation. For the multiplicative model (7) one may be inter- 
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ested in obtaining point estimates of o?, the aj, B;, and yi; (¢=1, 2,---, 7; 
j=1, 2, ---,c). In this connection it is to be noted that 


(31) 


im1 j=l 


=(ae) (@=1,2,-+-,7) (32) 


j=l 


j=1 


Paralleling (3) to (6), one can define the (geometric mean) estimates 


r 2 l/re 
6? = ( Il TT 


( Il II 


j=l 


43 = lle l/r 1,2 (37) 
(118) 


For the approximating multiplicative model (19) one may be interested in 
obtaining point estimates of the \,; (¢=0, 1, - - , ma; 7=0,1, -- +, m). Away 
of doing so is to use the Method of Least Squares to estimate the \f=log Ajj, 
i.e., one can minimize 


(a,b) j=0 
with respect to the Xj obtaining estimates jf (say), and then use 
hij = (i = 0, = 0,1,---, ms) (39) 


as his estimate of \,;. The estimates (34), (35), (36), and (37) could have been 
obtained using the above procedure. And it is clear that under very general 
conditions this procedure will provide consistent estimates of the various 
parameters. 

If the s?, are replaced by log $2,, then variations of the techniques developed 
by Box, Hunter, and others [10, Chap. 11], can be used to “explore” the trans- 
formed approximating multiplicative response surface (19). And it might be 
anticipated that the results of Kiefer and Wolfowitz [17] and others on the 
efficient spacing of peints in polynomial regression will indicate the best choice 


1, 2, r) (35) 
= 1,2,-++,¢) (36) 
' 
| 
= 
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of the design points (a, b), i.e., the best choice of the levels of Factors A and B in 
order to obtain efficient estimates of the ),;. 

4.2 Confidential regions and multiple comparisons. One may desire to es- 
timate o?, the a;, 8;, and +;;, or the \,;, individually and jointly by means of 
confidence regions. Of course, to do so some fairly strong assumptions must be 
made about the form of the distribution of the X’s (although only very weak 
assumptions would be necessary in Section 4.1 when point estimates were 
being constructed). One may also wish to make exact multiple comparison 
statements about the oj, similar to those which can be made about the y.; using 
procedures devised for normally distributed variatles by Scheffé and Tukey 
[19, sections 3.5 and 3.6], and Dunnett [12]. A step in this direction for one- 
factor experiments has been made by David [9]. 

4.3 Hypothesis testing. 

4.3.1 Some particular univariate hypotheses. In this section some of the 
simpler forms of a univariate multiplicative model for variances are considered, 
and typical hypotheses that might be tested are indicated. In Section 4.3.2 it is 
then shown how these various hypotheses can be regarded as special cases of a 
univariate general multiplicative hypothesis. In all of this the emphasis will be 
on the multiplicative model (7). 


One-Factor Experiment (Univariate Response) at r Levels 


Suppose that one is considering a one-factor experiment (univariate response) 
with 


(= 1,2,---, =1,2,°-- (41) 


where o,;?>0. In (41) the parameters o? and a; are defined by 


r 
ay = G5 II «: 


t=1 


Clearly the a; satisfy the equation 


II (44) 


i=1 


For r=3 one might be interested in testing hypotheses of the following types: 
Hy: a = a2 = as (45) 


= 91 (46) 
= 92, = 93 (47) 


where 41, g2, gs are specified (unitless) positive constants. 


r 2 l/r 
( II (42) 
1,2,---,7). (43) 
= 
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For the situation in which the X are normally distributed, H, is usually 
tested using Bartlett’s test [1], but other tests have been proposed by Cochran 
[6], Eisenhart and Solomon [14], Doornbos [11], and Hartley [15]; if the Xa 
are exponentially distributed, H, can be tested using Sukhatme’s test [21]. 
Once the form of the underlying distribution of the chance variables is specified, 
the particular test to be used depends on the alternatives against which the 
experimenter wishes to protect himself; the additional criterion of computa- 
tional simplicity is sometimes also introduced. It is interesting to note that 
the exact distribution of the statistic on which Bartlett’s test is based is not 
known in closed form for r>2 although approximate bounds on the upper 
percentage points have been tabulated [18, Table 32]. For r=2 Bartlett’s test 
reduces to the F-test (which also can be used to test hypotheses such as H.2). 
The exact distribution of the statistic on which Sukhatme’s test is based is 
not known for r>3. 


Two-Factor Complete Factorial Experiment (Univariate Response) No Inter- 
action, Factor A at r Levels and Factor B at c Levels 


Suppose that one is considering a two-factor complete factorial experiment 
(univariate response) the model for which is given by (7). It is assumed that 


Ay =1 0) (48) 


and hence one can write 


ois (i 1, 2, r3J = 1,2, C). (49) 


Equation (49) states that Factors A and B do not interact in their “effect” on 
the variance. In this model the a; may represent the treatment “effects” on 
the variance, and the 8; may represent the block “effects” on the variance. 
One may be interested in testing hypotheses such as H,, H2, or H3. Or he may 
be interested in testing a hypothesis such as 
Hg: = ag = = a, Bi = Bo = Be (50) 
4.3.2 The univariate general multiplicative hypothesis. It will now be shown 
that hypotheses such as H,, He, H3, and H, can be regarded as special cases of 
a univariate general multiplicative hypothesis. The method for two-factor experi- 
ments will be illustrated, and it will be seen that it generalizes immediately to 
an arbitrary number of factors. 
One has 
E{ Xin} = 
-++,N) 
2 2 
E{ (Xin — wis) } = 045 
1, 2, 1, 2, -,¢;k 1, 2, -,N) 


It is assumed a priori that the re variances oj, (52) obey the P 


(51) 
(52) 


ASSUMPTION EQUATIONS 
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where { tiip} is a given PX (rc) matrix of real numbers 


tier, biep 


\too1, 
toer, 


tris, 


tree, 


and V’= (vj, v2, +: +, vp) isa given P-vector of real numbers. Under these as- 
sumptions, one would like to test the hypothesis that the rc variances o4, (52) 
obey the Q 


HYPOTHESIS EQUATIONS 
@=1,2,---,Q) (55) 
im] jel 

where U= {uj;,} is a given QX(rc) matrix of real numbers 


Uie2, °° * Ui2Q 


Utely 


U102; 


U211, U2i2, * » U2i1g 


Ue2e1, * * » U22Q 


Ure1, U2e2, * * Ureg 


Uri1l, Uri2, 


Ura1, Ure2, * * » Ureg 


and W’=(w, we, - - + , We) is a given Q-vector of real numbers. The elements 
of the T- and U-matrices must satisfy the equations 


tooo, , loop 
=. (54) ’ ! 
trip 
U=4- (56) 
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tip =O (p 

(57) 

= O (q 

j=l 
The equations (57) imply (see (58) and (59) that only contrasts among the log 
oi, will be considered. (In particular, this means that one cannot test hypotheses 
such as Ho:a,=go where go is a specified positive constant in the units of the 
problem.) 

The assumption and hypothesis equations, (53) and (55), can be written 

equivalently as 


r c 


DX Dd tiip log = log Up (p (58) 


i=1 j=l 


r ec 


Dd log ois = log we (q=1,2,---,Q), (59) 


i=l j=l 


respectively. It is assumed that the equations (58) are linearly independent of 
the equations (59) and that the rank of T plus the rank of U is less than or 
equal to rc. In the form (58), (59) it is seen that one is testing a general linear 
hypothesis concerning the logarithms of the variances, under the assumption 
that these logarithms satisfy certain linear relationships. 

The fact that the representation (53), (55) contains the problems of Section 
4.3.1 as special cases will now be demonstrated for two-factor experiments. 
Suppose that one considers the multiplicative model (49). The assumption 
equations (53) follow from (48), and for r=3, c=2 one has 


{ 2 log ou — 2log ox: — log Pd + log Oss — log on — log iin = Q (60) 


—log + log + 2 log — 2 log — log + log = 0. 


For r=3, c=2 the hypothesis equations (55) associated with hypotheses H,, 
H», H;, and H, can be shown, using (4) and (5), to be 


log o1:-+log log n+ log o22—2 log log 


: 61 


Hz: logen— logoz =2logg: (62) 


(63) 


{ 2 log out? log log log log log 2 log ge 
log out log log log 2 log log oe =2 log gs 


log outlog log ont log 2 log —2 log =0 
H,: {log outlog log on~3 log log ont log = 
log out log log on? log log 
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It is to be noted that (60) to (64) are contrasts among the log oj. The rank of 
the T-matrix is 2, and the rank of the U-matrix for H,, H2, H3, and H, is 2, 1, 2, 
and 3, respectively. For each hypothesis, the hypothesis equations are linearly 
independent of the assumption equations (60), and the rank of T plus the rank 
of U is less than re=6 in each case. 

It should be noted that most of the standard tests of hypotheses concerning 


the e.g., ((=3, 4, -- +, ma; J=3, 4, +, me) where ma, m23, 
can also be regarded as special cases of a univariate general multiplicative 
hypothesis. 


No exact small-sample theory exists at present for testing the univariate 
general multiplicative hypothesis (55), even for the particularly interesting 
case in which the X’s are normally distributed. Moreover, it can be anticipated 
based on related classical studies of one-factor experiments that exact results 
for the normal case will in general be extremely difficult to obtain. However, 
until such theory is developed, it should be possible to obtain excellent approxi- 
mate results using the log transformation (22) and the univariate general linear 
hypothesis theory for means. 

4.4 Multiple-decision problems. Many interesting and useful multiple- 
decision problems can be formulated using the univariate multiplicative model 
(7) for variances. 

For example, the ranking approach which was used by Bechhofer [3, Sec- 
tions 4 and 5] for two (or multi) factor experiments involving population 
means can be generalized to two (or multi) factor experiments involving popu- 
lation variances. Thus, in a two-factor experiment with no interaction between 
factors one can seek that level of Factor A which is associated with the smallest 
“effect” on the population variance and simultaneously that level of Factor B 
which is associated with the smallest “effect” on the population variance; other 
goals of this type can also be proposed. Research along these lines is in progress 
[4] for the situation in which the X ; are normally distributed; it will generalize 
to f factors (f= 2) the results reported for one factor in [5]. 

Also, the ranking approach which was used by Somerville [20] for one-factor 
experiments involving population means can be generalized to multifactor ex- 
periments involving population variances. And the slippage approach which 
was used by Truax [22] for one-factor experiments involving population vari- 
ances can be generalized to multifactor experiments. 


5. THE TWO-FACTOR EXPERIMENT (MULTIVARIATE RESPONSE) 


In this section a multivariate generalization of the univariate multiplicative 
model for variances for the two-factor complete factorial experiment, no inter- 
action, which was discussed in Section 4.3.1 will be considered, i.e., a generaliza- 
tion of (49). Typical hypotheses that might be tested will be indicated. The 
method of generalizing the approach to multifactor experiments (with or 
without interaction between the factors) will be clear. 


Two-Factor Complete Factorial Experiment (Multivariate Response), No 
Interaction, Factor A at r Levels and Factor B at c Levels 


Suppose that: one is considering rc N independent vector-observations 
X = (Xijecy, + + » in a two-way layout. It is assumed that the 


260 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1960 


population mean and the population variance of Xj), and the population 
correlation coefficient of X (8, 8’ =1, 2, +++, p) depend only on 
the particular cell in which the observation is located, i.e., that 


E{Xin} =n (65) 
E\ (Xin — wi) (Xin — = = 
(66) 
where }_,; is a positive-definiie matrix. Here 

is the p-vector of population means, 

= diag ois, vi), 
is the pXp diagonal matrix of population standard deviations, and 


1 * * * 


= 


) 
| 
Pij(21) | | 


(69) 


is the pXp matrix of population correlation coefficients. (Note that the possi- 
bility of a different correlation matrix for each of the re combinations of levels 
of Factor A and Factor B is allowed.) 


As in (3) to (6) one can define the scalars 
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= ( Il Teun) (s=1,2,--- 
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= 1 p). (74) 
One can next define the p Xp diagonal matrices 


D, = diag o@, +++, (75) 


1/2 1/2 1/2 


1/2 1/2 1/2 1/2 
Da, = diag {Bj), Bj, » Gj=1,2,---,¢), (77) 
the positive elements of which are given by (70), (71), and (72). Finally one 
can define the square matrix 


(78) 


which if P,;=P (¢=1, 2, -- -,r;j=1, 2, - ¢) can be written as 
= D.PD.. (79) 


Then it is clear that assuming (74) is equivalent to assuming that one can 
write 


1/2 1/2 


= DeDa; Da; p= 1,2,---,r;37 =1,2,---,c) (80) 


Cis 
and hence that 


1/2 1/2 _1/2 


Dis = Ds, De, Deg 1,2,-- 0) (81) 


where )>,; depends on i and j only through P;; and not through the o;;¢) 
(t=1,2,---,rj;j=1,2,---+,c;s=1,2,---,p). We term (81) a multivariate 


multiplicative model for variances. The matrices D,? and Dj satisfy the equations 


II = TI De, = 1p 
t=1 t=1 

82) 
1/2_1/2 
II Ds; Ds; = I] Do, = 1, 


j=l j=l 


where I, is the p Xp identity matrix. 

If the multivariate multiplicative model (81) for variances is appropriate for 
the experiment, then the aie), Bj), and yij~) can be given the same experi- 
mental interpretations as in Section 3 except that in every case the “effect” is 
on the variance of Xj), the sth component of the vector chance variable 
X jj. Equation (81) states that Factors A and B do not interact in their “effect” 
on the variances of the X jj) (s=1, 2,---, p). In this model the ai.) may 
represent the treatment “effects” on the variance of X jjx;s), and the Bj) may 
represent the block “effects” on the variance of (¢=1, 7; 
j=l, p). 
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In Section 5.2 a practical problem for which the multivariate response model 
(81) appears reasonable is described. For this problem it is assumed that 
the vector-observations X,; have a multivariate normal distribution. 

5.1 Some particular multivariate hypotheses. For the model described above 
one might be interested in testing hypotheses which for r=3, c=2 (say) could 
be expressed as 


(83) 
(84) 


(85) 


(86) 


where D,, D,,, and D,, are diagonal matrices with specified (unitless) positive 
constants on the diagonal; these hypotheses are multivariate analogues of H,, 
H>», H;, and H,. Or one might be interested in testing a hypothesis such as 


: ai) *** = ) (i = r’) (87) 


where (2<p; <p) and (1Sr’ Sr). (One might also wish to test hypotheses con- 
cerning the P;;-matrices (69) or the elements of these matrices.) As might be 
anticipated, it can be shown that hypotheses such as Hj, H}, - - - , Hj can be 
regarded as special cases of a multivariate general multiplicative hypothesis. 

To the best of the author’s knowledge, hypotheses such as Hj, Hj, ---, Hj 
for multifactor experiments have not been treated heretofore, even under the 
assumption that the X;;, have a multivariate normal distribution. 

5.2 A practical multivariate example. Suppose that it is desired to study what 
happens to the range and deflection dispersion of a projectile when the shape 
of the projectile is changed. Three projectile shapes are being considered. It is 
realized that the total number of projectiles (of the three shapes) which are 
to be included in the test will be too great for the firing program to be com- 
pleted in one relatively homogeneous (with respect to level of wind velocity) 
time period, e.g., a day. Moreover, it is also recognized that even within each 
day small cycles (of unpredictable direction and duration) in the level of wind 
velocity may occur. From the viewpoint of eliminating the effect on the vari- 
ance of changes in the level of wind velocity, it is decided to consider each day 
as a block. In order to eliminate these block “effects” on the variance, and to 
minimize the possible biasing effect on the variance of small within-the-day 
cycles in the level of wind velocity, the following experimental design was 
adopted for the firing program: “Within each day 3N projectiles (N each of 
the three shapes) are to be fired. The order of firing is to be in ‘plots’ of three 
projectiles, each projectile shape being represented exactly once in each plot; 
the order of firing the projectiles of the three shapes within each plot is to be 
randomized.” It is suspected that the range and deflection coordinates measured 
from the center of impact are correlated because of the “jump” of the gun, but 


Hi: De, = Dey = Dos 
‘pie 

| 

Hi: Da, Da, = Dy, 
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the value of the correlation coefficient associated with the gun to be used is 
unknown. 

For the problem described above, the multivariate multiplicative model (81) 
with P;;=P (¢=1, 2, ---,7r;j=1, 2, c) would appear to be appropriate. 
Also a bivariate normal distribution might be assumed. Here p=2 since both 
the range and deflection coordinates of the points of impact are being measured, 
r=3 since three projectile shapes are being used, c equals the number of days 
over which the firing program is conducted, and N is the number of projectiles 
of each shape fired in each day. It is assumed that there is no interaction 
“effect” of the projectile shapes and the levels of wind velocity on the variance 
of either the range or deflection coordinates of the points of impact. Thus tne 
experiment is one involving the use of randomized complete blocks; the “effect” 
of the projectile shape on the variances is being studied, while the “effect” of 
the wind velocity on the variances is being eliminated. 
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CERTAIN UNCORRELATED STATISTICS 


Rosert V. Hoae 
University of Iowa 


It is proved that an odd location statistic and an even location-free 
statistic are uncorrelated when sampling from a symmetric distribu- 
tion. It is of interest to note the relative simplicity of the proof of this 
theorem, which is a generalization of two special results of Ostle and 
Steck. 


ECENTLY, in this Journal, Ostle and Steck [1] gave two proofs of the fact 
Ran the sample mean and the sample range are uncorrelated if the random 
sample comes from a symmetric distribution. In addition, they pointed out 
that one of the proofs also implies that the sample mid-range and the sample 
range are uncorrelated. In this paper, these results are generalized by showing 
that every member of a certain class of location statistics and certain location- 
free statistics are uncorrelated provided the random sample is taken from a 
symmetric distribution. The proof of this generalization is seemingly easier 
than either of the two proofs given in the paper cited. Before presenting this 
proof, some definitions and examples are given. 

Let X be either a continuous-type or a discrete-type random variable having 
the probability density function f(z). 

Definition A. The distribution (of probability) is symmetric about the 

point c if 


fle — x) =f(e+2) 


for every real z. 
In the two following definitions it is assumed that X;, Xo, - - X, isa random 
sample from a distribution symmetric about c. 
Definition B. The statistic T(X1, X2, - + - , Xn) is an odd location statistic if, 
for all real x21, - + +, Xn, We have 
1. T(aith, teth, +++, +++, 2n) +h for every h, 
2. T(—21, °° +, —T (a1, In). 
Definition C. The statistic S(X,, X2,---, Xn) is an even location-free 
statistic if, for all real x1, x2, - - , We have 
1. S(aith, reth, =S(ax1, 22, Xn) for every h, 
Some examples of odd location statistics are the sample mean, the sample 
median, and the sample mid-range. However, for example, the largest sample 
item minus some known constant, while a location statistic, is not an odd 
location statistic. The sample variance, the sample range, the sample quasi 
ranges, the sample mean deviation from the sample median, and any ratio of 
two of these statistics are examples of even location-free statistics. 
In the preceding terminology, the theorem can be stated as follows. 
THEOREM. Let X;, Xo, ---, X, be a random sample from either a con- 
tinuous-type or a discrete-type distribution that is symmetric about the point 
c. If the correlation coefficient of an odd location statistic T(X1, Xo, «+ + ,Xn) 
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and an even location-free statistic S(X1, Xe, - - - , Xn) exists, it is equal to zero. 

PROOF. The proof is given for a continuous-type distribution, but the proof 
in the discrete case is essentially achieved by changing the integrals to sums. 

It should first be observed that the existence of the correlation coefficient 
of T and S implies the existence of the means, the variances, and the covariance 
of these two statistics. From the conditions that are imposed upon 7’, it follows 
that F(T) =c. The proof of this fact is given later. However, assuming that it 
is true, we now show that the covariance, and hence the correlation coefficient, 
is zero. 

Let V denote the covariance of T and S. Then 


Under the transformation z;=y;+c¢, t=1, 2, - - , n, having a Jacobian equal 
to one, the covariance is seen to be equal to 


since, by Definition B-1, 
te) —c = Yn) 
and, by Definition C-1, 
++, yn +e) = Sy, Yn). 


By the change of variables y;= —z;, i=1, 2, +++, , expression (1) can be 
written as 


J f(—2n + + den. 


Upon making use of Definitions A, B-2, and C-2, V becomes 


(2) V= -{ f Ter, °° (ar +c) 


A comparison of expressions (1) and (2) yields 
V=-+-V or V=0. 


That the mean of T is c can be proved by showing that 


D = fre. yan) — + + + 


is equal to zero. This is done by using exactly the same transformations and 
argument given above. This, then, completes the proof of the theorm. 
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Finally, it should be noted that the theorem is actually true for every random 
vector (X;, X2,---, Xn») whose distribution of probability is symmetric 
about the point (c, c, - - - , c) in the n-dimensional space. That is, the assump- 
tions that X,, Xs,---+, Xn are stochastically independent and identically 
distributed are unnecessary, as is the restriction to either a continuous-type or 
discrete-type distribution. The author would like to thank a referee for point- 
ing this out and for suggesting certain other improvements in this paper. 
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CHANGES IN THE RATE AND COMPONENTS 
OF HOUSEHOLD FORMATION 


SHERMAN J. MAIsEL 
University of Califernia, Berkeley* 


In the prediction of future household formation, it proves useful to 
separate movements caused by population changes from those caused 
by shifts in the occurrence rates of households among a given popula- 
tion. The varying rates of occurrence reflect the differential impacts of 
economic vairables and social change. This study indicates that until 
better data are available, forecasts of households will have to rely 
primarily upon judgment. This can be gained from an analysis of past 
data plus information carried over from related series which have more 
frequent observations. No single statistical function can be found to 
express the manner in which the Census estimates of households have 
moved. This apparent lack of correlation reflects the need to average 
several years’ data to obtain accuracy. It also stems from relatively 
few past observations, and the likelihood that significant structural 
shifts have occurred and have altered past economic relationships. 


1. INTRODUCTION 


OUSEHOLD formation is a basic variable used in forecasts, projections, and 
analysis of economic growth and consumption. While much of household 
formation can be analyzed as a demographic process, it is sufficiently influenced 


by social and economic forces to change unevenly over time. 

Since most forecasting models of the markets for consumers’ durables, and 
particularly that for housing, employ shifts in the rate of household formation 
as a key factor, predicting future movements of households has become a 
major problem of economists and statisticians. Much of recent mortgage 
market discussion uses an implicit model in which household formation re- 
sponds with greater or less elasticity to shifts,in credit terms. The current 
boom in construction of rental units has been attributed to the shift in type of 
households. Furniture, automobiles, and appliance demands are all related to 
household formation. [1, Ch. 9; 2; 5; 6, Ch. 20; 3, Ch. 8] 

Table 1 shows that during the past decade the Census estimates of the num- 
ber of households formed per year have fluctuated widely. Can we assume 
from this that the demand for houses and the saturation level for consumers’ 
durables have experienced similar sharp year-to-year shifts? If so, what has 
caused them? 

The number of non-husband-wife households grew in this period by about 
four million, or nearly 50 per cent. While highly improbable, it would be 
possible for about 25 million new households of this type to be formed between 
now and 1975. Is it likely that the rate of formation of households from 
these atypical groups will continue to accelerate and thus to increase total 
demand still more rapidly than in recent years? An answer to these questions 


* This paper was prepared with assistance from the Real Estate Research Program, Bureau of Business and 
Economic Research, University of Califurnia, Berkeley. 
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TABLE 1. U. 8. HOUSEHOLDS AND HOUSEHOLD FORMATION, 1947-1959 


Annual Change From Previous Period 


Number in Millions (in Thousands) 


Other 


Husband- 
Wife Male 
Head 


a 


April 1947 
April 1948 
April 1949 
March 1950 
April 195i 
April 1952 
April 1953 
April 1954 
April 1955 
March 1956 
March 1957 
March 1958 
March 1959 | 51. 


362 
44 

- 
214 194 
142 102 
92 422 
848 240 60 180 
668 —185 275 
256 328 275 
453 447 118 329 


cto 


CHO 


Source: U. S. Bureau of the Census: Current Population Reports—Population Characteristics Series P-20, N. 94. 


and an understanding of what is involved in future forecasts of households is 
basic in attempts to prepare long-run plans for many industries. 

The movements reported in the tables result from population growth, from 
economic and social changes, and perhaps from errors or sampling variability 
in the data. An extremely useful first step in forecasts and in explanations 
of past movements is to attempt to separate these different factors. [10, 
Ch. 3; 14] 

If it appears that reported fluctuations reflect primarily short-range statis- 
tical aberrations, then it is important not to develop elaborate explanations 
based on other, less probable grounds. 

If the main source of growth turns out to be demographic, then we can ex- 
pect considerable reliability for our forecasts. For periods up to 20 or 25 
years, forecasts of population influences on household formation are compara- 
tively simple and, at least in recent years, have been among the most accurate 
tools in the forecaster’s kit. 

If, however, we find that the main causes of movements have been rapidly 
shifting occurrence rates of households, then forecasters cannot face the future 
with much confidence. This would indicate that household formation is 
strongly influenced by economic variables or social change. It would show 
households arising as people either changed their tastes or were able to imple- 
ment past desires for more individual and independent living and consuming 
units. Future formation would depend upon how these economic and social 
variables fluctuate. : 

As a preview of what is to come, we can state that for most of the period for 
which we have data, population changes have been by far the main source of 
household growth. Only between the end of World War II and the end of the 
Korean incident were movements of the occurrence rate of major importance. 
Since 1951, the rate has continued to climb, but at a slow and fairly even pace. 
The year-to-year reported movements about this gradual growth trend have 


Other | | 7 
Husband- 
Date Total Wife Male | Female Total 
Head 
5 31.9 1,425 | 1,288 | 137 -299 | 436 
: 2 33.3 1,650 | 1,357 | 293 287 6 
6 34.1 | 1,497 
7 34.4 | 1,017 
5 35.1 | 848 | 
3 35.6 830 
9 35.9 559 
8 36.3 895 
8 37.0 1,088 
5 37.7 758 
4 38.0 859 
3 38.4 900 


270 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1960 


not exceeded the amount one would expect them to vary as a result purely of 
reporting and sampling causes. 

The widely-used Census B projections of household formation to 1975 are 
based on an assumption that the rate of increase in the occurrence rate will 
retard, but that total growth will increase because of increasing population 
factors. Even though no exact relationship between economic prosperity and 
household formation can be demonstrated, judgment based on experience of 
the past decade might well indicate use of a somewhat higher projection of 
atypical households for most full-employment models. 

At first approach, it appears simple to measure past changes in household 
formation and to test theories of how and why fluctuations occur. The Census 
is acknowledged to be one of our better sources of social statistics. One can 
reach into many Census volumes and reports and come up with series that 
appear to be tailored completely to the purpose at hand . [4] It is only after 
a careful analysis of footnotes, changing definitions, and revisions that one 
becomes less certain. The process of obtaining the proper data requires a de- 
tailed search for logical assumptions, constant revisions, and a great deal of 
cross-checking and vigilance. 


2. CONCEPTS AND DEFINITIONS 


Standardization is the statistical technique commonly used by demogra- 
phers in order to hold constant selected variables. [10, Ch. 3; 7; 14] In exam- 
ining data on household formation, one rapidly recognizes the important in- 
fluence and shifting impact of alterations in the size of population. Only 
slightly less obvious is the fact that variations occur through changes in the 
age-sex distribution. In 1959, for example, the number of households headed 
by a person under 20 years of age was extremely small—170,000. In the popu- 
lation group between 20 and 29 years old, only 31 per cent headed households. 
The percentage who were heads of households then rose rapidly, reaching 47 
per cent of those aged 30-44; 53 per cent of those 45-64, and 59 per cent for 
those 65 and over. As the percentage of the population in each of these groups 
varies over time, the expectation of households among a given sized popu- 
lation shifts. 

Standardization attempts to separate the demographic influences (the re- 
sults of the shifts in both the numbers and the age-sex distribution of the 
population) from all other variables. Standardization is not as exact as some 
other statistical techniques because, as with index numbers, the results vary 
somewhat depending upon the base used. 

Occurrence-rates measure the results of movements in socio-economic and 
all other non-demographic forces. The standardized occurrence rates are an 
average of the specific rates of occurrence for each age-sex group in a period, 
utilizing as a base a constant population with a standard age-sex distribution. 

The term households, as used in this paper, conforms to the Census definition 
of a group of people who together occupy a dwelling unit. 

Households are divided into husband-wife households and all others. In a 
husband-wife household, the head is a married man living with his wife. “All 
others” may have either a male or female head, or may be a household com- 
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prised of a single individual, a mother with children, nonrelated individuals, 
etc. 

Doubled couples are married couples who do not head their own households. 
The number of married couples is equal to the number of husband-wife house- 
holds plus the number of doubled couples. 

Marriage is the act of becoming married. The number married at any time 
is the net result of all past marriages minus all marriages dissolved for any 
reason, such as death, divorce, or separation. Because these dissolutions are 
constantly occurring, the annual number of marriages is considerably larger 
than the net increase in the number married. In fact, the net increase has 
ranged between 20 and 50 per cent of the total marriages in a year. The larger 
ratios apply in years when marriages are highest. [1, app. c; 8] 


3. ANNUAL CHANGES 


Since 1947, the Census has published annual data on changes in households 
by type. These are the series which are normally used for short- and medium- 
term forecasting and which form the basis for much econometric work. 

Table 1 indicates sharp year-to-year shifts in household formation and still 
sharper shifts by type. The total number of new households ranges in the pe- 
riod from a low of 559,000 to a high of 1,650,000. Year-to-year changes show 
differences in the rate of growth of 70 per cent or more. The largest changes 
in the household rate occurred before 1952, but recent movements are great 
enough to cause sharp shifts in demand. The table estimates the combined 
results of demographic changes and fluctuating desires to form households. 

Table 2 shows the related changes which took place in the standardized 
household occurrence rates during this same period. All rates are based on the 
age-sex distribution of the population in 1950. The first two columns use as a 
base the total population aged 20 and over. The non-married households are 
based on the age-sex distribution of the nonmarried, and the doubled couples 
utilize the distribution of married men in that same year. 

Because the movements of Table 1 arise from all types of forces, analysis 
is strengthened by standardization and the ability to hold demographic factors 
constant. A comparison of Tables 1 and 2 shows surprisingly large year-to- 
year changes due to population forces. A large part of the annual differences 
in total household formation could arise from this cause alone. 

Comparing the change in household formation between 1951-2 and 1952-3, 
we see comparative stability in the total number of households formed but 
an increase in the household-rate in one year and a fall in the next. The sharp 
drop in the number formed during 1953-4 compared to 1952-8 appears to 
be accounted for almost entirely by population factors. In several years, the 
total units formed and the standardized rates move in opposite directions. 
Clearly, if we are interested in causation and prediction in trying to isolate 
and explain the influence of economic variables, we must examine primarily 
the data on rates rather than those on actual numbers of units formed. 

In analyzing these year-to-year fluctuations, however, and before attempt- 
ing to explain them by a complex theory, we must take into account the fre- 
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quently neglected fact that in addition to economic and demographic causes, 
changes may occur because the data are derived from a sample and because 
they may reflect non-sampling errors and biases. [9] 

The problem of sampling variability is evident when one compares the 
year-to-year changes in Table 1 to the appropriate standard errors furnished 
by the Census. For total households, the standard error for the difference 
between two yearly estimates is 325,000. Given the annual reported changes 
in the number of households, the fact of growth is evident. When we compare 
movements in this growth from year to year, however, we cannot be certain 
that any fluctuations in the rate of change have occurred. The maximum 
year-to-year deviations in the amount of growth since 1951 just about equal 
a single standard error. 

In the smaller groups, fluctuations have exceeded probable sampling vari- 
ances. Thus the standard error applicable to the difference between two esti- 
mates of other male heads is approximately 125,000, while year-to-year devia- 
tions in the number of households formed of this type have run as high as 
515,000. Before we accept even these changes as significant, however, and at- 
tempt to explain them, we should examine the possibility of non-sampling 
errors and biases. 

Unfortunately, we find that deviations of the measured size could probably 
have been caused by these other statistical factors. During the past eight 
years the Census altered its base for computations once and the size of sample 
and its procedures twice. Each change was followed by a particularly violent 
movement in the reported number of male heads. It is also known that errors 
of response in 2 marginal group such as other male heads tend to be great. The 
Census states its belief that the Current Population Reports fail to enumer- 
ate marginal groups as accurately as, for example, the National Housing In- 
ventory. Other problems also arise. At one point the Census reviewed its 
assumptions concerning the allocation of nonrespondents within the group 
of nonmarried males and altered the estimated total by 10 to 20 per cent. 
[9, No. 55} 

Because of the high probability that such biases and errors plus sampling 
variance account for a large share of year-to-year fluctuations, the Census 
warnings against too great reliance on figures for any individual year should 
be more widely heeded. As an example, there is no obvious economic or 
social reason why the rate of male nonmarried heads should have dropped 
sharply in the very prosperous period of 1956-7 and then risen abruptly in 
the depression of 1958. It appears much more likely that these changes reflect 
statistical errors which may well have underestimated male heads by as much 
as 300,000 in 1957. 

These large potential sampling errors make it difficult to analyze quanti- 
tatively with much validity the annual variations in the occurrence rate, 
particularly for types of households. While over a longer span, an explanatory 
function could be derived if the biases and errors are random, this is not yet 
possible. The series are short. Major structural upheavals occurred because 
of the postwar housing shortage and then of Korea. On an a priori basis, this 
would indicate that any attempt to find significant simple econometric rela- 
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tionships would probably meet with failure. This proved to be the case. We 
could find no logical significant relationships between changes in the various 
rates and major economic variables. 

However, by inspection, changes in the husband-wife household rate do 
seem to be somewhat related to changes in income and employment. No such 
relationships appear for fluctuations in the rate of other households or of the 
total. On the theory that the lack of significant economic correlation was 
possibly due to the high sampling variability of these series and to the small 
number of observations, we sought a related index with more observations 
and a likelihood of greater reliability. Such a series exists in the estimates of 
total marriages occurring within the United States, as compiled by the 
National Office of Vital Statistics. [12] From these data, we constructed a 
quarterly series for marriages from 1951 through 1959, adjusting for seasonal 
movements and for the change in the number of males entering the mar- 
riageable age brackets. 

This adjusted Vital Statistics series for marriages was correlated with the 
average rate of unemployment for identical quarters. The resulting least 
squares equation is: 


Y = 141.3 — 3.78X r= — .59 
(0.87) N = 34 


Y =the monthly rate of marriages in thousands. 
X =the Census Bureau’s estimate of unemployment as a per cent of the 


civilian labor force. 

The marriage rate has averaged about 125,000 per month. This equation, 
therefore, indicates that for short-run fluctuations, about a three per cent 
decrease in the number of marriages occurs for each one per cent increase in 
unemployment. Since the marriage rate is only one of the factors affecting 
the number of husband-wife households and since deaths continue irrespec- 
tive of unemployment, a drop in the marriage rate will have more than 
twice as large a net effect on married household formation. Consequently, 
this equation would indicate that in the short run, a one per cent increase 
in unemployment would decrease the number of new husband-wife households 
formed in a quarter by 8 to 10 per cent. 

Returning to Table 2 and the column showing fluctuations in the occur- 
rence rate of husband-wife households, we see that movements in the rate 
appear to have the proper signs predicted by the above relationship for 
most years. - 

The data in Tables 1 and 2 seem to give us primarily information which 
can be utilized for qualitative analysis. If we average several years together 
as is done in Tables 3 and 4, the reliability of the indicated movements and 
our ability to explain them by type and cause is improved. Pure sampling 
variability will be a smaller percentage of the indicated change, and there 
is some possibility that the errors and biases will be offsetting. 

Clearly, the Census surveys have greatly increased our knowledge in com- 
parison with past series in which year-to-year changes were primarily esti- 
mated from assumed relationships with other data. Even these previous series 
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form the basis of some of our most widely used econometric models. The new 
data should improve these models greatly. However, whether the improve- 
ment is sufficient to make them useful for the exact estimates in which they 
are frequently employed seems to me a most debatable question. I feel per- 
sonally that not for a long time, if ever, are we likely to get data from 
annual surveys of this size which would enable us to handle econometrically 
with any degree of validity tests of even the most obvious economic variables. 


4. LONG-RUN MOVEMENTS IN HOUSEHOLD FORMATION 


What is the shape of the long-run trends of household formation? What 
do they indicate for the future? Table 3 shows a gradual net increase in the 
number of households formed per year, from a rate of 378,000 between 
1890 and 1910 to 504,000 for 1930-40. Because this increase was less than 
population growth, the per capita rate fell by more than 10 per cent. In the 
postwar period, household formation shot up to an average rate of over 
1,500,000 between 1947 and 1950. In the following years growth declined by 
nearly half. Projections for 1959-1965 show a still lower total. According to 
the projections, however, the period of smaller growth is nearly ended. After 
1961 the number of households formed per year should rise. The Census B 
projection, as shown in Table 3, indicates a growth rate of over 1,000,000 
per year between 1970 and 1975. ‘ 


TABLE 3. GROWTH OF HOUSEHOLDS IN THE UNITED STATES, 
1890-1975 


Annual Change from Previous Period 


Number in Millions Ga Thousands) 
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* Projection based on Census Series B. 

Sources: 1950 Population Census, Report P-E No. 2A Table B; and Census of Population, II: Table 102 (1890— 
1930 division by type are the author's): Current Population Reports: Population Characteristics, P-20, No. 90 and 
No. 94. 1890 and 1910 include some quasi-households. 


Historically, most new households have been of the husband-wife type. 
From 1890 to 1930, about 80 per cent of new households were of this standard 
type. In the present decade, however, new standard households have aver- 
aged only about 56 per cent of the total. In several recent years, new house- 
holds not headed by a married couple have risen to over two-thirds of the 
yearly increments. 
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Apparently a sharp break occurred in 1940 in the basic causal factors at 
work. Table 4 presents the broad sweep of movements by cause from 1890 
to 1959, together with a projection (the Census B estimate). The causes 
shown are the two demographic factors of population and age-sex varia- 
tion, together with the result of shifting occurrence rates from all other 
causes. The separation is again performed by the use of standardization tech- 
niques. 

This table calls attention to one of the more hopeful factors in making 
projections of future household formation. During the past 69 years, over 
four-fifths of household formation has been due to demographic changes. 
For the period from 1890 to 1947 the percentage from demographic sources 
alone was 95 per cent.! Between 1890 and 1940, less than 18,000 households 
per year were formed as a result of changing socio-economic pressures on 
the desire to separate into more or fewer households. The standardized 
household rate increased by somewhat less than 5 per cent during this entire 
50 years. Furthermore, the changes decade by decade were extremely moder- 
ate, falling well within this 5 per cent limit. 

The relative consistency of the occurrence rates, with a great preponder- 
ance of all types of household formation arising from demographic forces, 
was altered sharply by the developments of the war and postwar periods. 
(We consider the period from April 1940 to April 1947 as the war period, 
in order to avoid analyzing the effect of household separations as husbands 
entered and then left the armed services.) During this period, the draft was 
certainly a major social change leading to earlier marriages. It was ac- 
companied by a sharp rise in real incomes and a rapid fall in unemployment. 
A rather severe housing shortage accompanied by rent controls was due to 
the wartime restriction on residential construction. 

Contrary to many impressions, the war period showed no large increase 
in the occurrence rate. The tables show about a 20 per cent rise in the 
average annual increase in number of households between 1940 and 1947 com- 
pared to that of the prewar major depression decade. However, most of this 
was demographic, and the occurrence rate rose only slightly. Only about 74,000 
households a year, or 12 per cent of the total, came from this factor. 

The change in the over-all totals concealed radical movements in what was 
happening to the separate types of households. The most significant shift was 
in marriages, which occurred both at earlier ages and in greater numbers in 
the older age brackets. The standardized rate of married males rose from 67.1 
to 72.5, or roughly by one per cent a year. This shift accounted for over 40 
per cent of the growth in all households. In contrast, the standardized rate of 
nonmarried heads fell. 

The movements of other households reveal an interesting example of a whole 
mythology based on incorrect or misinterpreted facts. In the immediate post- 


1 The method used in Table 4 of multiplying the population increase by the beginning of the period occurrence 
rate, and the change in the household rate due to demographic forces by the total population at the end of period 
to measure the total demographic increase results in a somewhat arbitrary allocation between the demographic and 
nondemographic forces of formations in which the two have interacted. The total increase ascribed to each factor 
would differ somewhat if another method of al!>cation were used. Such a change would not, however, alter the basic 
analytical results. 
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war period, a tremendous outcry arose claiming that rent control had led to 
an inordinate growth of nonmarried heads. It was argued that rent controls 
allowed the nonmarried to spread out into space that should have been allo- 
cated to couples. While higher rents might have led to still further contractions, 
we note from Table 2 that the household rate for both nonmarried males and 
females fell between 1940 and 1947. 

There was a small increase in the actual numbers in these groups due to the 
larger population and its changing structure, but the increase was less than 
10 per cent of what it would have been if occurrence rates had not fallen. If 
only demographic forces had been at work, we would have expected over 35 
per cent, instead of the actual three per cent, of the new households to be 
of the non-couple type. For married heads, the changing age structure for the 
first time caused the total of demographic influences to fall behind the rate of 
increase in the adult population. This negative factor has continued and will 
still be a force decreasing household formation during the next decade. 

The largest increase in households for any period on which we have data 
occurred between 1947 and 1950. New households formed at the rate of over a 
million and a half a year—nearly three times the prewar level. This is also the 
only period in which a shifting occurrence rate outweighed demographic forces. 
Among married couples, the number of new households brought about by the 
changing occurrence rate was nearly 800,000 per year. Not quite half of this 
number was due to the continued fall in age at marriage. The largest share, 
though, arose from a decrease in the number of doubled couples. As the rate 
of housing construction speeded up in the postwar: period, more and more 
families could find their own houses. 

The rapid rise in other households between 1947 and 1950 was due primarily 
to the extremely favorable demographic forces at work. While the occurrence 
rate in this category did rise somewhat, it made much slower gains than that 
for the husband-wife group and accounts for only 18 per cent of new non-couple 
households. 

Between 1950 and 1959, the formation of new households slowed down but 
still remained 55 per cent higher than in the immediate prewar period. The 
differential in household formation was primarily due to gains in the occur- 
rence rates. However, as the period progressed, there was a decided reversal 
in the movements of these rates. These data are shown in Table 2. The rate of. 
married males fluctuated but was the same in 1959 as it had been in 1951. The 
ratio of household heads to the number of couples (the complement of the 
doubling rate) continued to climb slowly until 1955, then stabilized in 1955-8, 
and climbed again in 1959. 

On the other hand, the occurrence rates among the nonmarried continued 
to increase for the entire period. In contrast to rather minimal changes in 
previous periods, this rate rose from 33.9 to 41.0. Nondemographic forces ac- 
counted for slightly over half the new households in this group. However, be- 
cause of the slowing down of increased occurrences in the first two groups, the 
total household rate rose only 1.8 points (four per cent) between 1951 and 1959 
as compared to 3.5 points (eight per cent) in the four previous years. Thus the 
increase retarded from 2.0 per cent a year to about 0.5 per cent a year. 
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5. LONG-RUN PROJECTIONS 


Estimating the degree to which these changes reflect permanent movements 
in the occurrence rates and therefore the probability of their either stabilizing 
near present levels, continuing to gain, or moving back toward previous rates 
is, of course, the major problem for population forecasters. While it appears 
that both economic and social forces have been at work to cause these changes, 
it is extremely difficult if not impossible to show statistically a direct economic 
impact on the trend in rates. 

While many attribute the growth in the occurrence rates to our new economic 
era, this is far from obvious in the data. The economic impact on household 
formation is usually believed to work through either increases in real income 
or decreases in unemployment. If income is an important cause of the post- 
1940 shift, it must form a very discontinuous function, perhaps reaching a 
threshold, exerting considerable pressure, and then losing its influence again. 
From 1890 to 1930, the growth of income was rapid (equal to most projections 
of the future), unemployment was low, but the increase in the standardized 
household-rate was slight (in the order of magnitude of one per cent per decade). 
From 1929 to 1939, real income barely increased (falling on a per capita or 
per household basis), and unemployment was high; yet the household-rate in- 
creased about as much as in previous decades. Finally, since 1947 we have had 
economic growth, but not at unusual rates, while the household-rate has in- 
creased rapidly. [11, p. 1]. However, the exact timing of changes between 1940 
and 1950 may not reflect completely the changes in economic conditions be- 
cause the inability of households to expand more rapidly during the war may 
have caused a lag in growth rates. In addition to the growth of income, the rise 
of pension funds and social security may have made privacy more purchasable, 
while the social pressure for independent living may have increased. 

Another possible economic variable which could have caused shifts in the 
rate of household formation is credit. Again, using annual data, there is no obvi- 
ous correlation between household formation and either the interest rate on 
residential mortgages or the flow of capital into the residential mortgage market. 
It is possible that the increased married-rate and the sharp shift in it since 
1940 could be caused by lower downpayments on houses. Theoretically, less 
doubling might result from lower downpayments. It does not seem likely, 
however, that downpayments cause marriages to be moved up or set back, 
since most newlyweds do not purchase houses. Even with easier credit, most 
young married couples remain primarily tenants, reflecting their youth and 
lower incomes. 

In any case, there does not now appear any way to show statistically the 
impact of credit on household formation. Moreover, if there is any such in- 
fluence, it would by now have exerted most of its pressure toward a higher 
household rate, since both the credit variables and the number of doubled 
couples have come close to their minimum values. 

Since we have not been able to find any statistical relationship between past 
trends and other measurable variables, it seems necessary to forecast the future 
primarily on the basis of an analytical consideration of the changes which have 
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TABLE 5. EFFECT ON HOUSEHOLD FORMATION OF VARIOUS 


ASSUMPTIONS AS TO 1975 OCCURRENCE RATES 


Increase in Households over 1959 (in thousands) 
From Occurrence Rate Assumptions 
Type From 
Demographic At 1950-59 4 , At no change 
Change Census B rate of from 1959 

increase eh rate rate 

Husband-wife 8,550 2,750 3,800 4,000 —_ 

Other 4,000 350 3,900 21,500 —_ 

Total 12,500 3,100 7,700 25,500 — 
Total Increase 1959-75 15,650 20 , 200 38 ,000 12,500 
Total Households in 1975 66,775 74,475 92,275 63 ,625 


Source: Census projection of total and by type from Current Population Reports: Population Characteristics, 
P-20, No. 90. 


been taking place. In this way we can select what appear to be reasonable 
assumptions as to the likelihood of these movements’ continuing or shifting. 
For this purpose let us compare the Census B projection of approximately 
15,640,000 new households to be formed between 1959 and 1975 with other 
logical assumptions [9]. 

Tables 1, 3 and 4 allow us to compare the results of this projection for 
both the married and nonmarried to previous trends. Average household forma- 
tion would be 764,000 a year from 1959 to 1965 and then would rise to 1,040,000 
annually from 1965 to 1970, and to 1,140,000 from 1970 to 1975, or to an 
annual average of 977,000 for the entire period. About 80 per cent of this 
growth is expected to stem from demographic forces. The rate of married males 
shows a considerable increase in comparison with married females (indicating 
that more of the married females will be under 20). The sharpest increase occurs 
among the nonmarried female heads, echoing, according to the assumptions, 
the sharp rise in this group between 1950 and 1959. The total projected change 
in the household rate in the course of 16 years would be from 47.3 to 48.7, or 
less than half the increase from 44.0 to 47.3 actually experienced in the nine 
years from 1950 to 1959. This proceeds primarily from the assumption that 
undoubling of married couples cannot continue much further. 

What other assumptions are possible as to shifting occurrence rates between 
now and 1975, and how would these affect both household formation and the 
number of households expected in 1975? The results of a few possible assump- 
tions which give likely bounds to the problem of forecasts are shown in Table 5. 
The Census presents other assumptions in addition to their B Projection in 
Current Population Reports—Population Characteristics. Series P-20, No. 90. 

In Table 5 we accept the projection that approximately 12,500,000 new 
households would result from demographic increases alone if the 1959 occur- 
rence rates persist. Since between now and 1975 the bulk of household formation 
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from demographic forces will come from aging of the existing population, pro- 
jections on this base have a much narrower range of probable variance than 
have those based on a forecast of occurrence rates. 

Table 5 shows that husband-wife households form the bulk (89%) of the 
new units expected to arise from increased occurrence rates. This presents a 
sharp contrast to this past decade, in which less than half of the additions 
from socio-economic forces occurred in this classification. 

Table 5 also shows that the differences in total household formation which 
can be expected among husband-wife families alone is not large. The projected 
increase of 2,750,000 units from shifting occurrence rates is liberal. Larger in- 
creases would seem more and more unlikely, since the projection already as- 
sumes that doubled couples will fall to 1.7 per cent of the total and that the 
age of women at first marriage will be 20. For either the occurrence rates to 
continue to fall as they have in this decade or for the extremely high assump- 
tion to hold, doubling would have to fall to 1.0 per cent or the age at first 
marriage would have to decline to somewhat below 20. 

Neither event appears likely. The ratio of doubled couples to the total is 
already very low. The social reasons for living with parents, in quasi-house- 
holds, etc. should establish some floor below which doubling will not fall, no 
matter how favorable the economic conditions. While a decrease in median age 
at marriage te the lower level of some other countries is possible, it would 
seem to be hindered by our educational system. The gradual approach of these 
factors to a limit may account for the relative stability of the husband-wife 
occurrence rate in recent years. 

At the opposite extreme, it might be argued that an assumption that the 
occurrence rate will not fall below existing levels, with a consequent decline 
in household formation, is also too liberal for a lower bound. It has been guessed 
that the Great Depression caused the household occurrence rate to fall by sev- 
eral per cent. A similar collapse might again cause the loss of three or four mil- 
lion expected households. The main reasons for selecting stability rather than a 
decrease as the most pessimistic limit are the facts that the rate has gone up at 
least slightly in every decade including that of the. 1930’s, and that the proba- 
bilities of as extended and deep a depression as that of the 1930’s are believed 
to be slight. 

Table 5 also indicates that the possibility of error appears much greater 
among other households than for the husband-wife type. Previous tables have 
shown both erratic movements in the rates for the “other” category as well as 
rapid increases in the past decade. The Census B projection assumes that the 
increase in this category in the next 16 years will be less than 10 per cent that 
of the past decade. This sharp retardation is faster than was actually planned 
in the projections. It results from a faster increase than was assumed for the 
past three years. 

In this category, it is possible for recent rates of increase to continue and 
even to expand rapidly. What if the demand for privacy and the effect of social 
change leads to a continued rapid rise of nonmarried individuals as heads of 
their own households? The absolute maximum number would result if all per- 
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sons 20 and over, other than wives or inmates of institutions, were to head their 
own households. Such an extremely unlikely situation would mean more than 
a doubling of expected household formation. 

Although this assumption is implausible, it does enable us to see how much 
movement could occur among these household types and to become aware of 
the relative magnitudes involved. It also underlines the possible nullifying 
effect on household formation of a rise in the rate of marriages. If all adults were 
married, the maximum household-rate would be 50. But if the per cent of adults 
married stays at the projected 74 per cent, then the total household-rate could 
reach 63. 

If instead of the extreme case, with every potential household formed, the 
rates were to continue to grow only as they have in this decade, the formation 
of other types of households would still be 90 per cent higher than shown in the 
Census B projections. 

It is for this reason that, at the moment, some assumption between that of 
the Census B and the maximum would appear most reasonable. While we have 
seen that no short-run correlation between the occurrence rate for these other 
households and economic variables could be found, I should judge that the 
rapid increase in this rate in the past decade was probably related to prosperity 
and the consequent social changes which have occurred. Thus if a full-employ- 
ment economy prevails, a continuation of this trend might be expected. 

The data throughout the tables gives the analyst a basis on which to evaiuate 
these projections and upon which to select the figures he desires for his own 
forecasts. Those who look at past figures and see comparative stability in occur- 
rence rates from 1899 to 1947 are likely to judge the B projection of an average 
increase of 953,000 in household formation to be about 20 per cent too high. 
Those who convince themselves that the war brought about a basic change in 
social mores which will continue to stimulate the desire for independent house- 
holds, but not quite as fast as in the recent past, will accept the B projection as 
a basis for guidance. Those burned by past underestimates may feel that the 
rate of increase will continue to grow, particularly among nonmarried house- 
holds. In this case they might decide to raise the B projection by 10 to 15 per 
cent. 
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A STATISTICAL INVESTIGATION OF THE 
INDUSTRIALIZATION CONTROVERSY* 


STEPHEN SPIEGELGLAS 
Northwestern University 


Whether trade of the industrial countries with the non-industrial 
countries declines in an industrializing world has always been a con- 
troversial question. The present paper examines this controversy with 
reference to events in the postwar period. The validity of the findings 
is tested under alternative classifications of an industrial area and in 
terms of current value and deflated value figures. Recent changes in the 
commodity composition cf exports of both areas are then noted. The 
main finding is that the progressing industrialization in the postwar 
years did not unfavorably affect the volume of exports of the industrial 
area, although it did change, to some extent, the direction and com- 
position of the international commodity flows. However, certain condi- 
tions in the postwar period do not allow us to assume that we can re- 
solve the controversy in question. 


ECENT pre-occupation with the problem of economic development is con- 
R stantly bringing to the fore an issue which, for quite some time now, has 
been characterized by its highly controversial nature. That issue is the impact 
of industrialization of the underdeveloped countries on the international di- 
vision of labor, the exports of the industrial countries, the extent of world trade, 
and the commodity composition of that trade. 

It has been contended that industrialization has a disrupting effect on inter- 
national trade. Others have equally strongly maintained that the long-term 
prospects for world trade are, in spite of the industrialization of the less eco- 
nomically developed areas, as favorable as ever. 

The arguments of the pessimists are two-fold: (a) industrialization of the 
underdeveloped countries “makes it more and more difficult”! for the economi- 
cally more advanced countries to sell their exports and this leads to a declining 
volume of world trade as well as a declining share of those countries in that 
trade,? and (b) industrialization “seems invariably to require import barriers 
for the protection of infant industries”* and this tends to diminish the volume 
of world trade, too. 

Supporting evidence is found by these economists in the fact that since the 
1880’s the volume of world trade in both primary products and manufactured 


* The author is grateful to Professor R. W. Clower for valuable comment and very helpful suggestions. Any 
errors which might appear, however, are mine. 

1G. D. A. MacDougall, “Britain's Foreign Trade Problem,” Economic Journal, Vol. LVII, No. 225 (March 
1947), p. 80. The contention of MacDougall is that “nearly the whole fall in the volume of (British) exports between 
1913 and 1937 can be accounted for by the halving of (British) trade to the industrialized and industrializing 
countries of the world” (ibid., p. 81). 

?: A. J. Brown made the following comment on this point: “It does not follow that the old industrial countries’ 
proportionate share in the absolute expansion of world trade during a widespread industrialization of countries 
hitherto specialized on primary production would be same as their initial proportionate share of world trade, because 
such an industrialization would certainly alter the pattern of world trade very considerably” (Industrialization and 
Trade, The Royal Institute of International Affairs, London, 1943, p. 56). 

* This point was raised by L. W. Towle (International Trade and Commercial Policy, Harper & Bros., 1956, 
p. 859), but he does not seem to subscribe to this point of view at all, as can be judged from the reference to 
his book in footnote 7. 
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goods did not rise as much as world manufacturing production,‘ that the volume 
of world trade in manufactures “has increased less rapidly than trade as a 
whole, and much less rapidly than trade in primary products,”® and that “im- 
ports of manufactures from all sources have fallen from about two-fifths to 
less than one-quarter of consumption.’’ 

The optimists, on the other hand, stress (a) the expansion of world income 
resulting from the industrialization and the accompanying features of higher 
levels of consumption and higher standards of living,’ and maintain (b) that 
foreign trade “grows to be not a smaller but a larger part” * of the developing 
countries’ economies, particularly in the initial stages of their rapid develop- 
ment, and that, therefore, the effect on the industrial countries 1s by no means 
adverse.® They admit, though, that “while normally the import from the older 
industrial countries is not likely to decline as a result of industrial growth else- 
where, these imports are likely to change in character, for the countries in which 
industry develops will diversify their demand for consumption goods and in- 
crease their demand for manufactured capital goods.’’!” 

The present study is a statistical investigation undertaken to evaluate the 
most recently available data as they bear on the industrialization controversy. 
It is essentially a short-run analysis, inasmuch as it deals only with the changes 
in world trade patterns since World War II. No attempt is made to fit those 
patterns into longer-run trends as studied by several writers," nor are the find- 
ings of this study intended as forecasts of likely future tendencies. 

Main attention is given to the changes in the proportion of world trade 
taking place between the so-called industrial and non-industrial areas, and 
particularly to the change in the relative share of world trade accounted for 


¢ Folke Hilgerdt in his ground-breaking study on this subject matter (Industrialization and Foreign’ Trade 
League of Nations, 1945) has shown that the world’s manufacturing output rose 640 per cent between the periods 
1876-80 and 1936-38, while in the same period of time the volume of world trade in primary products rose 277 
per cent and that in manufactures rose only 188 per cent (Table on p. 14). Along the same lines lies the following 
comment by G. D. A. MacDougall: “Between 1913 and 1937 the world’s output of fact doubled, but 
trade in manufactures failed to rise” (op. cit., p. 82). On the other hand, the following statement by E. Staley seems 
to imply that the relationship between these two series was often cyclically determined: “In the years 1924-29 the 
correspondence between the growth in world industrial production and growth in the quantum of world trade was 
close. In the depression trade fell with production, only slightly less violently. Thereafter it failed to keep pace in 
recovery and although the quantum of world trade rose in 1937 almost to the 1929 level, it did not move into new 
higher levels with production” (World Economic Development, International Labor Organization, Series B, No. 36, 
Montreal, 1944, p. 134). 

5 Quoted from P. T. Elsworth, The International Economy (Macmillan, 1950), p. 851 and based on O.E.E.C., 
Interim Report on the Eurcpean Recovery Program, Vol. I. 

*G. D. A. MacDougall, The World Dollar Problem, (St. Martin’s Press, 1957), p. 209. The author interprets 
this tendency as an indication of the increasing self-sufficiency of all nations in the world. 

7 For an expression of this point of view see: P. T. Elisworth, op. cit., p. 849; L. W. Towle, op. cit., p. 17; and 
C. P. Kindleberger, International Economics (Irwin, 1953), p. 392. 

8 United Nations, Measures for the Economic Devclopment of Under-Developed Countries, (Department of Eco- © 
nomic Affairs, May 1951), . 58. 

® This is what F. Hilgerdt had to say on this point: “These younger countries must as a rule pay for their in- 
creased requirements of foreign primary products through the export of manufactured articles. At first sight, it 
might appear that the resulting increase in the exchange between these countries and primary producing areas 
would tend to affect the older industrial countries adversely; but closer analysis reveals that such a conclusion is 
untenable” (op. cit., p. 107). 

10 F. Hilgerdt, op. cit., p. 117. A similar statement can be found on page 135 in E. Staley’s study mentioned 
previously in footnote 4. 

1 The most thorough study of long-run tendencies was that by F. Hilgerdt which covered the period from 1870 
to 1938. Unfortunately, the data used in the two studies do not allow a useful linking of their results. 
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by the exports of the industrial area to the non-industrial area between 1938" 
and 1953-6. The appropriateness of the findings is then investigated under alter- 
native classifications of countries as well as in terms of current value and de- 
flated value figures. Finally, attention is paid to the changes in the commodity 
composition of exports of both areas. 


1, TRADE BETWEEN AND WITHIN INDUSTRIAL AND NON-INDUSTRIAL AREAS 


This study utilizes the G.A.T.T.’s definition of an industrial" area, but it 
transfers six countries, namely, Denmark, Iceland, Ireland, Greece, Portugal, 
and Turkey from the <ndustrial into the non-industrial group.“ Accordingly, 
the industrial area is represented in this paper by the following thirteen coun- 
tries: Austria, Belgium and Luxembourg, Canada, France, Federal Republic 
of Germany, Italy, Japan, the Netherlands, Norway, Sweden, Switzerland, 
the United Kingdom, and the United States. All the remaining countries in 
the world form the non-industrial area. 

Given this distinction, the inter-area and intra-area commodity flows can 
now be studied. 

Table 1 shows that the value of world exports increased from about 22 
billion dollars in 1938 to about 95 billion dollars in 1956, or by approximately 
330 per cent. By far the greater increase is noted regarding exports of the in- 
dustrial countries which amounted to 357 per cent. In contrast, the exports of 
the non-industrial countries rose by only 285 per cent. As a result, the propor- 
tion of total exports accounted for by the industrial area rose from 59.8 per cent 
of total trade in 1938 to 63.9 per cent in 1956. But, as Table 1 reveals, this 
proportion was as low as 56.8 per cent in the immediate postwar period, when 
due to war-time destruction many of the industrial countries were unable to 
produce and to export a large number of commodities. Since then, however, 
there was a steady rise in exports of those countries. 

As far as imports of these two areas are concerned, Table 1 shows that those 
of the non-industrial area increased, between 1938 and ‘1956, proportionately 
somewhat more than those of the industrial area; the former by approximately 
335 per cent and the latter by 320 per cent. As a result of this, the share of the 
non-industrial area in world imports rose from 39.6 per cent in 1938 to 40.2 
per cent in 1956, while the share of the industrial area fell correspondingly. 


12 It could be argued that the selection of 1938 as the base year is unsuitable because it was a year of business 
contraction in most countries of the world and that the period 1937-8, as cycle average, should have been chosen 
instead. However, 1938 was selected as the base year for two reasons. First, the United Nations’ Direction of Inter- 
national Trade, the main source containing comparable data for the prewar and postwar periods, used 1938 to 
represent the former period, and second, a careful investigation of the 1937 and 1938 patterns of world trade, on 
the basis of League of Nations documents, indicated very similar percentages for the intra-area and inter-area 
flows, with which this paper is concerned, for both of those years. 

18 A better term would have probably been “predominantly industrial” instead of just “industrial” with respect 
to an area, but this writer decided to follow the G.A.T.T. terminology. Also, the term “rest of the world” would 
appear to have been more appropriate than “non-industrial area.” It should be pointed out that the selection of an 
industrial area does not, of course, imply that the countries composing it do not export several types of primary 
products. On the contrary, some countries, such as for example the United States, are important producers and 
exporters of agricultural products. (This remark is prompted by Sir 8. Caine’s critical comment on R. Nurkse’s 
similar classification of countries in his leading article, entitled “Trade Fluctuations and Buffer Policies of Low- 
Income Countries,” in a symposium on “The Quest for a Stabilization Policy in Primary Producing Countries,” 
Kyklos, Vol. XI, 1958, No. 2, p. 187). 

4 This was done because the exports of manufactures of each of these six countries represent only a relatively 
small percentage of their total exports. The minimum accepted was 47 per cent in the case of Canada. Admittedly, 
this criterion is entirely arbitrary. 
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TABLE 1. TRADE BETWEEN AND WITHIN THE INDUSTRIAL* AND 
NON-INDUSTRIAL AREAS, 1938, 1948, AND 1953-6 


(in millions of dollars) 


Exports to Industrial Area Non-Industrial Area Total World 


Exports from Value Value % Value % 


Industrial Area 6 ,860 6,450 29.0 13 ,310 
Non-Industrial Area 6,597 2,366 8,963 


Total world 13 ,457 8,816 J 22,273 


Industrial Area 15,417 16 ,339 31,756 
Non-Industrial Area 14,760 R 9,414 : 24,174 


Total world 30,177 < 25,753 ‘ 55,930 


Industrial Area 22,996 23 ,666 46 ,662 
Non-Industrial Area 20,330 8,661 4 28,991 


Total world 43 ,326 ‘ 32 ,327 . 75 ,653 


Industrial Area 24,425 24,387 : 48,812 
Non-Industrial Aare 20 ,666 3 8,985 29,651 


Total world 45,091 33 ,372 78,463 


Industrial Area 28 ,490 3. 24,686 53,176 
Non-Industrial Area 22,139 10 ,283 32,422 


Total world 50 ,629 34,969 85,598 


Industrial Area 33,161 27,728 ° 60 ,889 
Non-Industrial Area 23 ,896 10,618 ‘ 34,514 


Total world 57 ,057 38 ,346 95,403 


Industrial Area 27 ,268 25,117 52,385 
Non-Industrial Area 21,757 9 ,637 31,394 


Total world 49 ,025 58.5 34,754 : 83,779 100.0 


* No adjustment was made for territorial changes. However, this should not affect the results to any appre- 
ciable extent, if at all. 


Source: United Nations, Direction of International Trade 1956, Series T, Vol. VIII, No. 7. 


The outcome of all these developments was that the non-industrial group 
had a negative net balance of trade in 1956 and in other postwar years, as com- 
pared to a positive balance in 1938. 

We turn now to the exports within and between the two groups of countries. 
Table 1 makes it quite clear that all four major trade flows increased sub- 
stantially since 1938 in both absolute and relative terms. This is shown in 
greater detail below: 


Aver. Amount 
From Area To Area 1938 1953-6 of : er cent 
Exports Exports | Increase — 


Industrial Industrial 6,860 27 , 268 20,408 
Non-Industrial Industrial 6,597 21,758 15,161 
Industrial Non-Industrial 6,450 25,117 18 ,667 
Non-Industrial Non-Industrial 2,366 9,637 7,271 


‘ 4 
1938 40.2 
100.0 
56.8 
1948 43.2 
100.0 
61.7 
1953 38.3 
100.0 
1954 37.8 
100.0 
62.1 
1955 37.9 
100.0 
1956 36.1 
100.0 
te 
1953-6 37.5 

297 

230 

289 

307 
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As far as the absolute changes are concerned, one notices that the industrial 
countries have expanded their trade with each other most, while the trade 
within the non-industrial area increased least and still remains the smallest 
proportion of world trade. 

With regard to percentage changes, it is found that exports both within the 
industrial and non-industrial areas and from the industrial to the non-industrial 
area rose by approximately the same proportion, while the exports from the 
non-industrial to the industrial area lagged behind. This fact leads, no doubt, 
to the most important observation which can be made on the basis of Table 1, 
namely, that there was an almost steady decline in the relative importance of 
the total value of trade taking place between the industrial and non-industrial 
areas in the postwar period; the share of that trade having declined from 58.6 
per cent of total trade in 1938 to 55.6% in 1948, 58.2% in 1953, 57.4% in 1954, 
54.7% in 1955, and 54.1% in 1956." 

This decline, it is essential to stress, was entirely due to the relative decrease 
in exports from the non-industrial to the industrial area, whereas the proportion 
of world trade accounted for by the imports of the non-industrial from the in- 
dustrial area remained virtually unchanged. 

It is definitely desirable to verify this conclusion (as well as other observa- 
tions) on the basis of alternative groupings of countries forming the industrial 
and non-industrial areas, since it is possible that a transfer of one or more 
countries from one area into another may affect the findings quite considerably. 
For example, one could criticize the selection of countries forming the industrial 
area in this study by pointing out that the inclusion of Canada, the least in- 
dustrialized among the thirteen countries chosen, is likely to exaggerate the 
extent of trade taking place within that area, on account of the conditions pe- 
culiar to U.S.-Canadian trade. 

Also, one could argue that it is necessary to exclude the trade figures of the 
Soviet Bloc'* countries and of mainland China from the figures in Table 1 on the 
grounds (a) that this might quite seriously affect the prewar versus postwar 
comparisons, and (b) that the lack of complete data on intra-Soviet Bloc trade 
and the trade of that Bloe with mainland China certainly does disadvantageously 
affect the proportion of trade taking place within the so-called non-industrial 
area, however defined. 

In addition, it was found desirable to form an industrial area consisting of 
the nineteen G.A.T.T. countries on the basis of the above-mentioned United 
Nations publication, in order to determine the extent of the differences in re- 
sults between this and the G.A.T.T. study, arising from the utilization of 
alternative sources of data.!” 

The results of all these recomputations are shown in Table 2. 


% According to the Institut far Weltwirtschaft und Seerverkehr the comparable figures for some selected years 
prior to World War II were as follows: 58.8% for 1913, 62.2% for 1925, and 62.6% for 1929 (“Die Aussenhandelsent- 
wicklung und das Problem der deutschen Ausfuhrpolitik,” Weltwirtschaftliches Archiv, Vol. XXXVI, July 1932, 
p. 34). Unfortunately, both sets of figures cannot be directly compared with one another on account of differences 
in the classification of countries. 

% The countries composing the Soviet Bloc, as understood in this context, are: Albania, Bulgaria, Czecho- 
slovakia, East Germany, Hungary, Poland, Rumania, and the U.S. 8. R. 

17 The G.A.T.T. figures originated from various sources (see the appendices to International Trade 1956, pp. 
259 and 261, and Trends in International Trade, Geneva, Oct. 1958, p. 129). 
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TABLE 2. TRADE BETWEEN AND WITHIN THE INDUSTRIAL AND 
NON-INDUSTRIAL AREAS, ACCORDING TO ALTERNATIVE 
CLASSIFICATIONS OF COUNTRIES, 1938, 1948, AND 1953-6 


(in millions of dollars) 


Exports to Industrial Non-Industrial 


hoes Total World 


Exports from | Value % | Value % Value % 


Industrial Area consisting of 12 countries 


Aven | 5,506 24. 6,921 31. 12,427 


1 
Non-Industrial Area 7,245 32. 2,601 11.7 9,846 


Total world 12,751 J 9,522 8 22 ,273 


Industrial Area 10,572 ; 17,986 32.2 28,558 
Non-Industrial Area 17 ,023 ; 10,349 5 27,372 


Total world 27 ,595 , 28 .335 ‘ 55 ,930 


Industrial Area 15,723 26 ,698 42,421 
Non-Industrial Area 23 ,663 : 9,569 33 , 232 


Total world 39 ,386 : 36 , 267 : 75 ,653 


Industrial Area 17 ,560 ; 27,198 ‘ 44,758 
Non-Industrial Area 23 ,895 9,810 33,705 


Total world 41,455 37 ,008 78 ,463 


Industrial Area 20,745 ‘ 28,021 . 48 ,766 
Non-Industrial Area 25 , 663 11,169 13. 36 , 832 


Total world 46 , 408 39,190 85,598 


Industrial Area 23,981 31,961 55,942 
Non-Industrial Area 27 ,808 11,653 39,461 


Total world 51,789 43 ,614 95 , 403 


Industrial Area 19 ,502 : 28,470 j 47 ,972 
Non-Industrial Area 25 , 257 10,550 35, 807 


Total world 44,759 39 ,020 83,779 


World totals without the Soviet Bloc and mainland China 


Industrial Area 6,860 34.1 5,376 12,236 
Non-Industrial Area 5,691 28.3 2,189 : 7,880 


Total world 12,551 ‘ 7,565 20,116 


Industrial Area 15,417 ‘ 15,261 P 30 ,678 
Non-Industrial Area 13 ,559 7,200 é 20,794 


Total world 28 ,976 . 22,496 F 51,472 


Year 
1938 44.2 
100.0 
| | 
51.1 
1948 48.9 
100.0 
56.1 
1953 43.9 
100.0 
57.1 
1954 42.9 
100.0 
56.9 
1955 43.1 
100.0 
58.6 
1956 41.4 
100.0 
57.3 
1953-6 42.7 
100.0 
60.8 
1938 39.2 
100.0 
59.6 
1948 | 40.4 
100.0 
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Exports to Industrial Non-Industrial 


Total World 


Exports from Value % Value % Value % 


Industrial Area 22,996 31.2 23,079 31. 46,075 62. 
Non-Industrial Area 19,575 26.6 8,014 10. 27,589 37. 


Total world 42,571 57. 31,093 42. 73,664 100. 


Industrial Area 24,425 32. 23,709 , 48,134 63. 
Non-Industrial Area 19,839 26.: 7,555 27,394 


Total world 44,264 31,264 75,528 100. 


Industrial Area 28,490 23 ,838 52,328 63. 
Non-Industrial Area 21,027 8,466 29,493 36. 


Total world 49,517 32,304 81,821 


Industrial Area 33,161 ‘ 26,485 28. 59 ,646 
Non-Industrial Area 22,600 9,469 32,069 


Total world 55,761 60. 39.5 91,715 


Industrial Area 27 , 268 ; 30. 51,546 
Non-Industrial Area 20 ,760 : 29,136 


Total world 48,028 59. ; 80 ,682 


Industrial Area consisting of 19 countries 


Industrial Area 8,148 . 5,883 26.4 14,031 63.0 
Non-Industrial Area 6,121 ; 2,124 9.5 8,242 37. 


Totai world 14,269 ‘ 8,004 35. 22,273 100. 


Industrial Area 18,044 15,003 26. 33,047 59. 
WNon-Industrial Area 14,251 8,632 15. 22,883 40.¢ 


Total world 32,295 23,635 42. 55 ,930 


Industrial Area 26,475 ‘ 22,190 29. 48 ,665 
Non-Industrial Area 19,334 10. 26,988 


Total world 45,809 39. 75,653 


Industrial Area 28,081 29. 50,889 
Non-Industrial Area 19,732 ; 2 10. 27 ,574 


Total world 47 ,813 : 39. 78,463 


Industrial Area 22,917 55,375 
Non-Industrial Area ’ 9,141 : 30 , 223 


Total world 32,058 85,598 


‘ 

— 

5 
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1954 

100.0 

65.1 

ee 1956 34.5 

100.0 

1953-6 36.1 

1938 

100.0 

64.3 

1953 | 35.7 

100.0 

64.9 

1954 35.1, 

100.0 

1955 35.3 

100.0 
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Exports to Industrial Non-Industrial 


puns Total World 


Exports from Value % Value % Value % 


Industrial Area 37,203 39.0 25,955 27.2 63,158 
Non-Industrial Area 22,807 23.9 9,438 9.9 32,245 


Total world 60 ,010 9 35,393 95,403 


Industrial Area 31,054 23 , 467 54,521 
Non-Industrial Area 20,739 PY 8,519 ‘ 29 ,258 


Total world 51,793 8 31,986 38.2 83,779 


G.A.T.T.’s Industrial Area consisting of 19 countries 


Industrial Area 8,260 38.6 6,052 28.3 14,312 
Non-Industrial Area 5,342 24.9 1,763 8.2 7,105 


Total world 13,602 63. 7,815 36. 21,417 


Industrial Area 18,071 34. 14,952 33,023 
Non-Incdustrial Area 13,170 24. 6,933 13. 20,103 


Total world 31,241 21,885 53 , 126 


Industrial Area 26,821 17,905 44,726 
Non-Industrial Area 18,538 ‘ 7,084 F 25 ,622 


Total world 45 ,359 24.989 70,348 


Industrial Area 28,106 19,561 47 ,667 
Non-Industrial Area 18,480 7,814 26 , 294 


Total world 46,586 63. 27,375 37. 73,961 


Industrial Area 32,363 39. 21,081 25. 53,444 
9,6 8 


Non-Industrial Area ,543 : 28,221 


Total world 52,041 29 ,624 81,665 


Industrial Area 37,409 . 23,376 25. 60,785 
Non-Industrial Area 20 ,912 8,806 29,718 


Total world 58 ,321 32,182 : 90 ,503 


Industrial Area 31,175 : 20,481 : 51,656 65.3 
Non-Industrial Area 19,402 8,062 2 27,464 34.7 


Total world 50,577 28 ,543 79,120 100.0 


Sources: United Nations, Direction of International Trade 1956, Series T, Vol. VIII, No. 7, and G.A.T.T., 
Trends in International Trade, Geneva, Oct. 1958, Appendix Table A, opposite page 130, 


66.2 
1956 | 33.8 
100.0 
65.1 
1953-6 34.9 
100.0 

66.9 
1938 33.1 
100.0 
62.1 
1948 37.9 
100.0 
63.6 
1953 | 36.4 
100.0 
64.4 
1954 35.6 
100.0 
65.4 
1955 34.6 
100.) 
67.2 
1956 32.8 
100.0 
1053-6 | 
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Table 2 discloses the importance of the classification of countries. The extent 
of the differences cannot be discussed in greater detail here for lack of space, 
but some idea is obtained from comparing the percentages of increase (between 
1938 and 1953-6) given for each classification below. 


Exports from: 


Classification employed Ind. Area| Ind. Area |Non-Ind. Area|Non-Ind. Area 
to to to to 
Ind. Area |Non-Ind. Area} Ind. Area |Non-Ind. Area 


(A) Industrial Area consisting 
of 13 countries 297% 289% 230% 307% 
(B) Industrial Area consisting 
of 12 countries 254% 311% 249% 306% 
(C) Asin (A), but world totals 
without the Soviet Bloc & 
China 297% 352% 265% 283% 
(D) Industrial Area consisting 
of 19 countries 281% 299% 239% 302% 
(E) G.A.T.T.’s Industrial Area 
consisting of 19 countries 277% 238% 263% 357% 


Since we are concerned primarily with the changing direction of world trade, 
it is important to note that in the following summary table each of the five 
classifications listed above has in common the previously observed decline in 
the share of inter-area trade. 


Trade between the industrial and non-industrial areas 


as percentage of world trade in: 
Classification employed 


1938 


| 
| 
| 


It is possible to criticize the general observations made previously on yet a 
different ground by arguing that the presentation of the statistical evidence 
was carried exclusively in terms of value and that deductions from volume 
figures might tell a different story. Table 3 reveals that this is not the case. 

The volume of world exports rose in the postwar period less than the unit 
value. If prices of exported commodities would not have risen at all, the average 
volume (and value) of world exports would have amounted in the period 1953-6 
to only approximately 33.7 billion dollars, rather than 83.8 billion dollars, as 
shown in Table 1. 

Most pronounced was the increase in prices of exports of the non-industrial 
area which more than tripled between 1938 and 1953-6. This in itself accounts 
for most of the increase in the value of exports of this area. Without the price 


(A) 58.6% 54.1% 

AL (B) 63.6 | 62.7 

(C) | 55.0 | 53.5 

Wet (D) 53.9 51.1 

ba (E) | 53.2 | 48.9 
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TABLE 3. INDICES OF VALUE, UNIT VALUE, AND VOLUME OF EXPORTS 
AND IMPORTS OF THE INDUSTRIAL AND NON-INDUSTRIAL AREAS, 
1956 AND 1953-6 


(1938 = 100) 


Category* and years Value Unit Value 


Exports from industrial countries to both in- 
dustrial and non-industrial countries 
1953-6 394 
1956 457 
Exports from non-industrial countries to both 
industrial and non-industrial countries 
1953-6 350 
1956 385 
Imports of industrial countries from both in- 
dustrial and non-industrial countries 
1953-6 364 
1956 424 163 
Imports of non-industrial countries from both 
industrial and non-industrial countries 
1953-6 394 159 
1956 435 174 
World exports (or imports) 
1953-6 376 151 
1956 428 170 


* It should be noted that all export and import categories include both inter-area as well as intra-aiea trade. 
In consequence, the exports of any given area are not equal to the imports of that particular area. 


Sources: Value of exports: Table 1 
Unit value of exports: United Nations, Monthly Bulletin of Statistics, Nov. 1958, Special Table C, 
p. XVI. 
All unit value and volume indices of imports are my own derivations on the basis of theabove mentioned 
sources, 


rise, the volume (and value) of exports would have risen from 9 to only 10 billion 
dollars, rather than to 31.4 billion dollars. On the other hand, the rise in the 
value of exports of the industrial area can be attributed almost equally to 
rising volume and prices. 

With respect to imports, Table 3 shows that the rise in their unit value was 
much more pronounced than the rise in their volume in the period 1938-56, 
for both the industrial and non-industrial areas. And whereas the price rise was 
somewhat greater for the industrial area, the increase in volume was greater 
for the non-industrial area. 

The net effect of all these changes (on the export as well as on the import 
side) on the trade balances of both areas was already previously mentioned. It 
suffices to add that the large shift in that balance in disfactor of the non- 
industrial countries was due not to the terms of trade, which were favorable'® 


18 This generalization certainly does not apply to all non-industrial or, for that matter, industrial countries. 
For instance, the terms of trade of the United States declined in the period 1937-56 from 131 to 98, while at the 
same time the terms of trade of Canada remained practically unchanged and those of Japan rose from 91 to 104. 
(For changes in other countries’ terms of trade since the prewar years, see International Monetary Fund, Inter- 
national Financial Statistics.) 
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to them as compared with the prewar years, but solely to an increase in the 
volume of their imports in comparison to that of their exports. 

Another important conclusion, crucial for our analysis, can also be drawn. 
It can be shown, namely by deflating the figures given in Table 1 by the unit 
value indices given in Table 3, that the decline in the proportion of world trade 
taking place between the industrial and non-industrial areas is also demon- 
strable on the basis of volume figures—a decline from 58.6% in 1938 to 52.5% 
in 1956—and not only on the basis of value figures. 


TABLE 4. TRADE BETWEEN AND WITHIN THE INDUSTRIAL AND 
NON-INDUSTRIAL AREAS, BY MAJOR COMMODITY GROUPS, 
1938 AND 1952-5 


(in billions of dollars) 


Industrial Non-Industrial 
Area Area World 
Exports from Exports to i 
ite? 2, Value % Value % Value % 


Industrial Area Primary Products -90 
Manufactures 2 


8. 
4. 
Non-Industrial Area | Primary Products 
Manufactures 


World Primary Products 
Manufaetures 


Industrial Area 
Non-Industrial Area 


Industrial Area Primary Products 14.26 


Non-Industrial Area | Primary Products 24.23 


World Primary Products 9.77 38.49 4 
Manufactures 19.15 35.06 -6 


44.63 ° 28.92 : 73.55 100.0 


Industrial Area 24.08 A 20.95 45.03 61.3 
| Non-Industrial Area 20.55 d 7.97 ‘ 28.52 38.7 


| 44.64 28.92 73.55 100.0 


* Inasmuch as sources differ, the figures given in this table do not completely agree with the corresponding 
figures given in Table 1. In addition, it'was impossible to obtain comparable data for 1956 and it is therefore that 
the period 1952-5 was chosen. 

Source: G.A.T.T., International Trade 1952, Table II, p. 117 (for 1938); International Trade 1954, Table 10, 
p. 33 (for 1952); International Trade 1955, Table 13, p. 32 (for 1953); and International Trade 1956, Table 16, p. 45 
(for 1954 and 1955). All the figures were adjusted to account for an industrial area consisting of only 13 countries. 
The adjustments were made on the basis of: League of Nations, Economic Intelligence Service, International Trade 
Statistics 1938, Geneva 1939 (for 1938), United Nations, Direction of International Trade, Vol. VII, No. 6 and Com- 
modity Trade Statistics (for 1952-5). 


ds 
Year 
*e 1938 6 | 2.74 12.4 | 4.64 21.0 
2 3.44 15.6 8.77 39.8 
ot 9 | 2.09 9.5 | 8.45 38.4 
IE: 0.18 0.8 | 0.18 0.8 
ae 8.26 37.5 4.83 21.9 | 13.09 59.4 
4 > 5.33 24.2 3.62 16.4 8.95 40.6 
e 13.59 61.7 8.45 38.3 | 22.04 100.0 
7.23 32.8 6.18 28.0 | 13.41 60.8* 
6.36 28.9 2.27 10.3 8.63 39.2 
Pe 13.59 61.7 8.45 38.03 | 22.04 100.0 
41.8 
32.9 

| 
| 
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2, CHANGING COMMODITY COMPOSITION OF EXPORTS OF BOTH AREAS 


We turn our attention now to the important consideration of the commodity 
composition of exports and imports of the industrial and non-industrial areas. 

It should be pointed out at the outset, however, that it is not intended to 
present here a lengthy description of the relative rises, or declines, in im- 
portance of specific commodities or commodity groups, because this topic was 
exhaustibly covered in a number of postwar treatises,!® and because this level of 
commodity disaggregation is quite unnecessary in a study which purports to 
provide only general tendencies dealing with the major trade flows. 

Instead, the very broad classificavion of commodities into the two basic 
categories, namely, primary products and manufactures, is adopted, which 
provides the basis for Table 4. 

The following conclusions are reached: 


(1) The exports of primary products represented, in 1938, 34 per cent of all ex- 
ports of the industrial area and 98 per cent of all exports of the non-industrial 
area. These percentage figures dropped to 31 and 85, respectively, in the period 
1952-5. It follows that there was a corresponding increase in the proportion 
which manufactured products occupied among total exports of both areas.?° It 
is important to note, however, that the exports of primary products still repre- 
sented in the more recent period a larger proportion of world trade than manu- 
factures, in terms of value. 


(2) The exports of primary products of the industrial area represented before 


and after the war approximately 35 per cent of world exports of those com- 
modities, whereas the non-industrial area accounted for approximately 65 per 
cent in each period. On the other hand, the share of the industrial countries 
in world exports of manufactured products declined from a virtual monopoly 
position before the war to somewhere in the range from 85 to 90 per cent. 


(3) The greatest absolute increases occurred in (a) exports of manufactures 
from the industrial to the non-industrial area and (b) exports of primary 
products from the non-industrial to the industrial area. Very interesting fea- 
tures of these increases were the profound changes in the commodity composi- 
tion of both flows. As regards exports from the industrial area, there was a very 
pronounced shift from textiles and miscellaneous manufactures to industrial 
equipment, transportation equipment, and chemicals. As far as exports from 
the non-industrial area are concerned, the most noticeable change in the com- 


19H, Tyszynski, “World Trade in Manufactured Commodities, 1899-1950,” Manchester School, Sept. 1951; 
W. A. Lewis, “World Production, Prices and Trade, 1870-1960,” Manchester School, May 1952; A. K. Cairncross 
and J. Faaland, “Long-Term Trends in Europe’s Trade,” Economic Journal, March 1952; H. Neisser and F. Modi- 
gliani, National Income and International Trade, Univ. of Illinois Préss, 1953; I. Svennilson, Growth and Stagnation 
in the European Economy, Geneva, United Nations, 1954; A. K. Cairncross, “World Trade in Manufactures Since 
1900,” E. ia Inter ionale, Vol. VIII, No. 4, Nov. 1955; C. P. Kindleberger, The Terms of Trade, A European 
Case Study, Wiley, 1956; R. E. Baldwin, “The Commodity Composition of Trade; Selected Industrial Countries, 
1900-1954,” The Review of Economics and Statistics, Supplement: Feb. 1958; S. Spiegelglas, “World Exports of 
Manufactures, 1956 Vs. 1937,” Manchester School, May 1959. 

20 One should be extremely cautious not to view the exchange between the industrial countries solely as an 
exchange of manufactures against manufactures, nor, for that matter, the exchange between the industrial and 
non-industrial areas as exchange of manufactures against primary products. 
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modity composition of those exports was the rise in the value of shipments of 
petroleum and petroleum products, tropical foodstuffs, and various minerals. 

In addition, one must stress the increase in exports of both primary products 
and manufactures within the industrial area which points, no doubt, to the fact 
that the industrial countries continue to be good, if not better, customers of 
each other’s products.” 


(4) The imports of primary products represented, in 1938, 61 per cent of total 
import value of the industrial and 57 per cent of the non-industrial area. In 
1952-5, these percentages stood at 64 and 34, respectively. This demonstrates 
that the imports of primary products rose only insignificantly in importance 
for the industrial area, but that the imports of manufactures rose greatly in 
importance for the non-industrial area. Whereas the former tendency must be 
attributed primarily” to the intensified use of substitutable materials and in- 
creased production of domestic raw materials in the industrial area as well as 
to the sharp rise in the prices of primary products, the latter must be related 
to the developmental effort of the non-industrial countries. 


(5) The industrial area took 63 per cent of all imports of primary products in 
1938 and 75 per cent in the period 1952-5, but a decreasing percentage of world 
imports of manufactures (45% in 1952-5 as against 61% in 1938). The im- 
ports of the non-industrial area changed correspondingly, those of primary 
products moving down and those of manufactures moving up. 

These developments are different facets of the same tendencies mentioned 
under (3). However, what makes them more interesting when viewed from the 
import point of view is the realization (a) that, taken jointly, all the technologi- 
cal and structural factors, such as the development of synthetic rubber and 
fibers, the increased use of substitutes (for example, aluminum), and agricul- 
tural tariffs, did not affect the total value of imports of primary products of the 
industrial area as one might have expected,” and (b) that neither the industrial 
development nor the high tariffs in the non-industrial area contributed to any 
appreciable extent to a lowering of the value of manufactures imported by the 
countries in this area. 

It may be important again to introduce considerations of volume and spe- 
cifically establish the percentage changes, between 1938 and 1952-5, in: (a) the 
total volume of all manufactured commodities exported; (b) the total volume 
of primary products exported, and (c) the volume of manufactures entering the 
non-industrial from the industrial area. 


21 A. K. Cairncross (op. cit., p. 726) found this tendency to have started as far back as the beginning of this 
century and to be especially true for manufactures. R. E. Baldwin (op. cit., p. 59) expressed some doubts, however, 
regarding Cairncross’ findings. 

% Agricultural protection and the drastic fall in the requirements for raw materials and fuel per unit of manu- 
facturing output in the industrial countries probably played a contributing role. It should be mentioned that the 
annual G.A.T.T. studies have devoted much attention to this particular topic. To clarify the issues involved, the 
non-industrial area was split into two groups: so-called “semi-industrialized” countries (which numbered 8) and 
all the others and this led to some very informative observations and results. 

2 But as the panel of experts points out in the recent G.A.T.T. study, there have been some counteracting 
developments, such as increases in the imports of bauxite, uranium, and petroleum, which at least partially ex- 
plain the increased value of primary products entering the industrial from the non-industrial area (Trends in Inter- 
national Trade, Oct. 1958, p. 6). 


4 
: 
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A simple computation™ yields the following results, representing deflated 
1952-5 values expressed in 1938 prices: 


World exports of manufactures approximately $16.5 billions 
World exports of primary products approximately $13.5 billions 
Exports of manufactures from the industrial to the 

non-industrial area approximately $ 8.0 billions 


A comparison with the 1938 figures given in Table 4 reveals: (a) that the 
volume of manufactures almost doubled, while the volume of primary products 
hardly changed at all, and (b) that the volume of industrial countries’ manu- 
factures entering the non-industrial area rose by about 135 per cent. 


3. MAIN FINDINGS AND THEIR RELEVANCE FOR THE 
INDUSTRIALIZATION CONTROVERSY 


It is desirable to reiterate those main findings of the preceding investigation 
which have a particularly strong bearing on the industrialization issue. They 
are: 


(1) There was a definite decline in the proportion of world trade taking place 
between the industrial and non-industrial areas, which can be demonstrated 
not only on the basis of value, but also on the basis of volume of trade. The main 
reason for this tendency was the relative decline (from 29.6% in 1938 to 20.4% 
in 1956) in export volume from the non-industrial to the industrial area. At the 
same time there was a sharp rise in export volume from the industrial to the 


non-industrial area and in trade between the thirteen countries forming the in- 
dustrial area. 


(2) There was a substantial increase in the total volume of world trade and 
particularly in the proportion held by manufactured goods in that total. The 
rising share of capital goods and the declining share of manufactured consumer 
goods deserve special mention in this connection.2® The main findings of this 
study tend to support the position taken by the optimists in their controversy 
with the pessimists. While it must be admitted that world industrial production 
rose, between 1938 and 1956, actually faster than the volume of world exports 
(by approximately 120% as against 70%), none of the other fears of the pessi- 
mists seem to have materialized in the postwar period. 

Thus, as far as the total volume of world trade is concerned, the statistical 


™ The indices used here were: 214 for manufactures and 285 for primary products. They were derived by splicing 
the indices given in the United Nations’ Yearbook of International Trade Statistics 1956 (Vol. II, Table D, p. 25) 
with the series given in G.A.T.T.'s International Trade 1952 (Table I, p. 110). As C. P. Kindleberger has shown 
there is no unique set of these indices available (See his “The Terms of Trade and Economic Development,” The 
Review of Economics and Statistics, Vol. XL, No. 1, Part 2, Supplement: Feb. 1958, Table 1, p. 74). 

% The corresponding figures (expressed in billions of dollars) were as follows: 

Current Value Deflated Value 
1938 1956 (1938 = 100) 


Exports from the non-industrial to the industrial area 6. 
Exports from the industrial to the non-industrial area 6. 
Trade within the industrial area 6. 
Source: Tables 1 and 3 
* For a fuller discussion of this issue, see S. Spiegelglas, op. cit. 


5 12.2 
9 14.6 


298 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1960 


data indicate that that volume has recently risen to unprecedented heights. This 
rise was due essentially to the increase in the volume of exports of the industrial 
countries. And what is quite important—not only did the industrial countries 
find good customers in other industrial countries’ markets, but the volume of 
their sales of manufactured goods in the non-industrial areas of the world 
actually more than doubled since the prewar years. The optimists appear also 
to have been quite right in maintaining that widespread industrialization 
results in a diversification of the demand for the products of the economically 
more advanced countries and particularly in a growing demand for their capital 
goods. This was clearly the case in recent times. 

On the other hand, there may be much validity in the argument which could 
be advanced by the pessimists to the effect that the substantial rise in the 
volume of manufactures exported to the non-industrial areas is to be mainly 
attributed to the favorable terms of trade of the non-industrial countries and to 
the high level of inter-governmental assistance in the period under study, and 
that these were only temporary occurrences obscuring the permanent ten- 
dencies. 

Since there is no way of determining whether the favorable terms of trade 
of the non-industrial countries and the high level of foreign aid and of credits 
were indeed merely temporary,?’ and especially to what extent they have ac- 
counted for the sharp rise in imports of the non-industrial countries, the resolu- 
lution of the controversy will have to await more convincing evidence which 
may develop in the future. 


27 Actually the difficulty lies primarily with the terms of trade, since foreign aid and credits are likely to be 
forthcoming if for no other reason than that East-West antagonism cannot be expected to disappear overnight. 
It seems appropriate to add that the recent findings of M. K. Atallah suggest that the terms of trade of the non- 
industrial countries will deteriorate in the future. (The Long-Term Movement of the Terms of Trade between Agri- 
cultural and Industrial Products, Netherlands Economic Institute, Rotterdam, 1958). 
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OPTIMAL PROPERTIES OF EXPONENTIALLY 
WEIGHTED FORECASTS* 


Joun F, Mutu 
Carnegie Institute of Technology 


The exponentially weighted average can be interpreted as the ex- 
pected value of a time series made up of two kinds of random com- 
ponents: one lasting a single time period (transitory) and the other 
lasting through all subsequent periods (permanent). Such a time series 
may, therefore, be regarded as a random walk with “noise” super- 
imposed. It is also shown that, for this series, the best forecast for the 
time period immediately ahead is the best forecast for any future time 
period, because both give estimates of the permanent component. The 
estimate of the permanent component is imperfect, and so the estimate 
of a regression coefficient is inconsistent in a relation involving the 
permanent (e.g. consumption as a function of permanent income). 
Its bias is small, however. 


1, INTRODUCTION 


ORECcASTS derived by weighting past observations exponentially (that is, 

geometrically) have been used with some success in operations research and 
economics. Magee [8], Winters [12],! and Brown [1] have used this approach 
in short-term forecasting of sales, primarily in inventory control. Distributed 
lags, while not always arising from explicit forecasts, have appeared in studies 
of capacity adjustment by Koyck [6], demand for cash balances during hyper- 
inflations by Cagan [2], the consumption function by Friedman [3], and agri- 
cultural supply functions by Nerlove [11]. Its main a priori justification as a 
forecasting relation has been that it leads to correction of persistent errors, 
without responding very much to random disturbances. 

Because exponentially weighted forecasts have been successful in a variety 
of applications, it is worth-while finding the statistical properties of time 
series for which the forecasting method would work well. The answer would 
allow the range of applicability of the forecasting method to be judged better 
and would ultimately lead to modifications when the conditions on the time 
series are not met. The methods we will use are related to those which have 
been moderately successful in control engineering.? 

We shall consider only one estimation problem—the possible inconsistency 
in the regression coefficient of the forecasts because errors of measurement may 
be present. The reader is referred to Klein [5] and Koyck [6] for a discussion 
of other estimation problems. 


2. “ADAPTIVE EXPECTATIONS” AND OPTIMAL PREDICTION 


The exponentially weighted moving average forecast arises from the follow- 
ing model of expectations adapting to changing conditions. Let y, represent 


* This paper was written as part of the project, “The Planning and Control of Industrial Operations” under 
grant from the Office of Naval Research. 

1 See also Holt et al. [4], Ch. 14. 

2 Statistical problems in engineering control are ably discussed in Laning and Battin [7]. 
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that part of a time-series which cannot be explained by trend, seasonal, or any 
other systematic factors; and let yf represent the forecast, or expectation, of 
yz on the basis of information available through the (t—1)st period. It is 
assumed that the forecast is changed from one period to the next by an amount 
proportional to the latest observed error. That is: 


(2.1) 


The solution of the above difference equation gives the formula for the ex- 
ponentially weighted forecast: 


yee 


Since the weights attached to prior vaules of y,; add up to unity, the forecasting 
scheme does not in this respect introduce any systematic bias. 

Now, suppose that the realizations of the random process can be written as 
some linear function of independent random shocks: 


> Wi€1-1, (2.3) 
t=1 
the shocks being independently distributed with mean zero and variance o”. 
As long as the parameters w,;, which characterize the random process, are 
known, it is a relatively simple matter to find the expected value of y, for any 
time period on the basis of previous outcomes. Suppose we wish to find the 
expectation of y; given €:-1, €:-2, €:-2, (that is, the only information lacking 
to give the value of y, exactly is the latest “shock,” e,). Then we simply replace 
«, by its expected value, which is zero. Therefore we have: 


> 


(2.4) 


In order to relate the regression functions above to the expression for the 
exponentially weighted forecast, we need to be able to write (2.4) in terms of 
the observed variables y:1, ys-2, . That is, we need to find the coefficients 
of the following function: 


(2.5) 
j=l 
Substituting from (2.3), and re-arranging terms, we obtain: 


= + +> ras) 


j=l 


5 
9 
(2.2) 
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er 
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By comparing coefficients of Equations (2.4) and (2.6), we have the necessary 
relation between the parameters, w;, associated with the latent shocks and 
those, vj, associated with the observable past history of the process: 


Wi, = v1 


2.7 
1=2,3,4,---. 
j=l 
In order to characterize the time-series for which the exponentially-weighted 
forecast is optimal, we substitute the weights 


vj = BIL — B)F", j=1,2,3,--- (2.8) 


from Equation (2.2) into Equations (2.7). The result of the substitutions is the 
system: 


w,=8 


i-1 
B(1 + > (1 = B)*"w,-;, i= 2, 3, 4, (2.9) 


j=l 
It follows that 
w;=8 for > 1. (2.10) 


Writing y; in terms of the independent shocks, as in Equation (2.3), we ob- 
tain: 


BD, er. (2.11) 
t=1 

The shock associated with each time period has a weight of unity; its weight 
in successive time periods, however, is constant and somewhere between zero 
and one. Part of the shock in any period therefore, has a permanent effect, 
while the rest affects the system only in the current period. 

We have assumed that the forecasts given by Equation (2.2) were for the 
period immediately ahead. The best forecasts for all future periods, however, 
are the same. In order to prove this assertion, it is necessary to generalize the 
previous results. Let y;,r represent the value of y.,r, forecasted on the basis 
of information available through period ¢ (T is called the “forecast span”). 
By an argument similar to that leading to Equation (2.4), we have the rela- 
tion: 


Wit TEti- 


t=0 


(2.12) 


However, we wish to express the forecast in terms of the observables y;, y:-1, 
Yt-2, , rather than the latent variables €:-1, . The desired equa- 
tion has the form: 


* Some ways in which the economic system might generate time series of this form is examined in [10]. 
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= UT (2.13) 
j=0 
From a knowledge of the weights w, we need to find the coefficients v. 
The appropriate relations, which are found in the same way as Equations 
(2.7), are the following: 
¢=0,1,2,--- 
Wet Ur, jWi-j, 2.14 
To find the coefficients of Equations (2.13), substitute from (2.10) into 
(2.14) to obtain the following system: 


B=vrit+B Dvr; t1=0,1,2,3---. (2.15) 
j=0 


Subtracting the conditions for i=k—1 from those for i=k (k=1, 2, 3,---), 
we obtain the difference equation: 


Urn = (1 — Borer, k=1,2,3,---. (2.16) 
Since, from Equation (2.15), vr,.=8, we obtain the unique solution: 
= = B)*, k= 0, 1, 2, (2.17) 


The forecast weights are therefore independent of 7, and the assertion is 
proved. This result is due to the fact that all prior shocks have the same weight, 
and that the forecasts only give an estimate of the permanent component of 
the shocks. 


3. EXPECTATIONS WITH INDEPENDENT COMPONENTS 


The same type of forecasting rule is appropriate if the permanent and 
transitory components in each period are statistically independent rather than 
perfectly correlated as in the preceding case. Let §, represent the permanent 
component of y, and 7, the transitory component, so that: 


Ye = De + me. (3.1) 


The transitory components are assumed to be independently distributed with 
mean zero and variance o. The permanent components are defined by the 
relationships: 


t 
€& = Dd 


t=1 


where the e’s are serially independent with mean zero and variance o%. The 
n’s and e’s are also assumed to be independent (this assumption is not, how- 
ever, an essential one). 

The forecasting problem, then, is to find the coefficients 1, m, +--+ in the 
equation: 


‘ 
{ 
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j=1 


which minimize the error variance, 


V=Ey-y))- (3.4) 


The methods used in Section 2 are not appropriate for the independent com- 
ponents because it is impossible to measure both the e, and the 7; from the one 
set of observed variables. We can, however, proceed as follows. Substitute from 
Equations (3.1)—(3.3) and write the forecast error in terms of the weights at- 
tached to past observations: 


j=1 t=1 j=1 


Setting the derivatives of V with respect to v, equal to zero, we have the 
following conditions on the optimal weights: 


= j 2 
(1- + = 0, k=1,2,---. (8.6) 


Ov; jak i=1 


Taking second differences of the conditions above to eliminate the long sum- 
mations, the optimal weights are the solution of the following difference equa- 
tions: 


(1+ =v. = — (3.7a) 


9 
2 


— = 0, (3.7b) 


The characteristic equation of the system above is 


_ 


(3.8) 


Because of the symmetry of Equation (3.7b), the characteristic roots occur in 
a reciprocal pair. Only the root less than unity, say 4, is relevant, because the 
infinite sums in Equations (3.5) and (3.6) would otherwise diverge. The rele- 
vant root may be written explicitly in terms of the variance ratio as follows: 


« gt 


(3.9) 


The solution will then be of the form: »,=cdj, where \, is the relevant char- 
acteristic root and c is a constant to be determined from Equation (3.7a). 


303 
(3.3) 
| 


304 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1960 


Substituting from Equation (3.8) into (3.7b), we find that c=(1—)1)/M. The 
weights appearing in the forecasting formula would therefore be the following: 


(3.10) 


Note that the weights have the same form as those in Equation (2.2) if 8=1—1. 
The forecast for the next period is the forecast for all future periods as well 
because it is an estimate of the permanent component, 4;. 

If the changes in the permanent component are small relative to the “noise,” 
then \, will be very nearly unity. The forecast then gives nearly equal weights 
to all past observations in order that the transitory components tend to cancel 
each other out. The forecasts then do not depend very much on recent infor- 
mation because it says very little about the future. On the other hand, if 
changes in the permanent component are large relative to the noise, \; would 
be small so as to weight the recent information heavily. 

It is, incidentally, not necessary to assume that ¢«, and ; are uncorrelated. 
If Eem=o. and Len,=0 (t¥s), it is only necessary to replace the ratio 
in Equations (3.7)—(3.9) by 


4, PERMANENT AND TRANSITORY COMPONENTS OF INCOME 


In the preceding sections we have characterized the time series for which 
the exponentially weighted moving average equals the conditional expected 
value. The important feature is that the time series consists of two random 
parts: one lasting a single time period, and the other lasting through all sub- 
sequent periods. The structure therefore resembles the hypothesis concerning 
permanent and transistory components of income, advanced by Friedman in 
his study of the consumption function [3]. The structure is not exactly the 
same, however, because Friedman intentionally left the definitions of “perma- 
nent” and “transitory” somewhat vague. We have had to be more specific in 
the definitions, according to Equations (3.1) and (3.2), although nothing in the 
analysis states how long a “period” must be. Nevertheless, it appears that the 
exponentially weighted moving average is an appropriate measure of pemanent 
income if Friedman’s other hypotheses are true. 

A problem which may be raised at this point concerns errors of estimating the 
marginal propensity to consume because of inaccurate measurements of the 
permanent component. Suppose that consumption depends on the true value of 

_the permanent component of income, on the assumption that households can 
identify the two sources much better than is possible from aggregate time series. 
The propensity to consume estimated from the exponentially weighted moving 
average of income would then be biased downward because of the errors of 
measurement. We will show, however, that the bias should be small. The con- 
sumption model is: 


Cy = ay; + 6, (4.1) 


where c, is the consumption in the ‘th period, a the propensity to consume, 
and 6, the error term of the equation assumed to be independent of the perma- 
nent component and its estimate. 
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The least-squares estimate of a (denoted by &) which results from using the 
value of the moving average, y{4,, instead of the true permanent component, 
is: 


& = (4.2) 
Its probability limit is: 
P lim & = Cov(5u, yers)/Var yty1, (4.3) 


as long as 6; is not correlated with yj. 

Assume that the process applies to the income of each household, with ¢ 
denoting the age of the head of the household starting from the beginning of 
his working life.‘ 


y (4.4) 


t=1 


From Equations (3.3) and (3.10), yf; may similarly be expressed in terms of 
the independently-distributed shocks: 


Yur = (1 — Ax) Ai 


k=l 


(4.5) 
k=1 
The variance of y/,, is therefore: 
t 2t 
1 — As 
(4.6) 


hy 


the latter equation arising from the dependence of \; on the ratio 02/0? ac- 
cording to (3.8). The covariance between #, and yf,, may be similarly ex- 
pressed as: 


Cov(5:, = [ Jot (4.7) 


The asymptotic bias, according to Equations (4.3), is determined by the 
ratio of (4.7) to (4.6). Since this ratio is very nearly unity for moderately large 
values of ¢t, it appears that this kind of measurement error of the permanent 
component would introduce very little asymptotic bias in the estimate of the 
propensity to consume. 


4 This uses one of the ideas of Modigliani and Brumberg in their approach to the theory of the consumption [9]. 
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LARGE-SAMPLE COVARIANCE ANALYSIS WHEN THE 
CONTROL VARIABLE IS FALLIBLE* 


Freperic M. Lorp 
Educational Testing Service 


When the control variable contains errors of measurement, the usual 
analysis of covariance fails to adjust adequately for initial differences 
between groups. A large-sample significance test is presented here for 
the case where the fallible control variable has been measured in 
duplicate. 


HE control variable in analysis of covariance may serve either or both of 

two functions: (7) to reduce the error variance, and hence to increase the 
sensitivity with which difference(s) between two or more groups on a criterion 
variable are measured (and, consequently, to increase the sensitivity of statisti- 
cal tests for the significance of these differences); (77) to compensate, in part, 
for fluctuations in the criterion variable due to otherwise uncontrolled differences 
among the groups. Certain dangers inherent in the use of analysis of covariance 
for this second function are summarized by Cochran [1, sect. 2.2] and by Fair- 
field Smith [5, sect. 3]. This use is nevertheless widespread, particularly in 
educational and social science research, where relevant variables frequently can 


Fie. 1 


neither be controlled experimentally nor randomized. The present paper is pri- 
marily concerned with the second function, in the special case when the control 
variable is subject to errors of measurement. 

The situation in an ordinary analysis of covariance may be illustrated as in 


* The writer is much indebted to Prof. John W. Tukey for his valuable suggestions on two drafts of this paper. 
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Fig. 1. The two ellipses represent the scatterplots of groups A and B for the 
control variable z and the criterion variable y. It is assumed for purposes of the 
illustration only that z and y have a normal bivariate distribution in each group, 
and that the regression line of y on z has the same slope in each group. If all the 
frequencies are projected in a direction parallel to these regression lines on to 
the y-axis, the distributions of the projected frequencies will be normal, as 
illustrated by the two normal curves in the diagram. The projected values, or 
“adjusted y-values,” as they are often called, provide a means for comparing 


y 


Fia. 2 


the two groups with the linear effect of variable x removed. The simplest type 
of analysis of covariance for two groups is equivalent to making an ordinary 
t-test on the adjusted y-values to determine whether or not the means of the 
adjusted y-values are significantly different. For the sake of clarity, Fig. 1 has 
been drawn so that the normal curves do not overlap; hence the figure repre- 
sents a situation where the difference between the groups on variable y cannot 
be readily accounted for by differences in variable z. 

Fig. 2 is presented in order to illustrate the effect of errors of measurement 
in the control variable. It will be illuminating to consider first of all an extreme 
case where x and y are perfectly correlated. All observed cases in groups A 
and B are assumed to fall on the line segments AA and BB, respectively. In the 
special case illustrated, these two line segments are collinear. The adjusted 


A a 
¥ 
he / 
a / 
/ @ 
x 
— 
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y-values are obtained by projecting all cases in a direction parallel to these 
lines, as shown by the large arrow. All the projected cases fall on the single 
point P; hence all adjusted y-values are identical. The conclusion is that dif- 
ferences between groups A and B in variable y are completely accounted for 
by differences in variable z. 

Suppose now that sizable errors of measurement are introduced into variable 
x, but not into variable y. The effect on the scatterplots is to spread the z-values 
out horizontally while leaving the y-values unchanged, as indicated by the 
small arrows in Fig. 2. If the errors of measurement are uncorrelated with the 
true values and if all variables are normally distributed, then the resulting 
scatterplots are represented graphically by the elliptical outlines of the small 
arrows. These scatterplots are identical in all respects with those in Fig. 1. 
Whereas the regressions of y on the true values of z are the collinear lines AA 
and BB, the regressions of y on the fallible values of x are the separate lines 
Ga and bb. [The exact locations of the lines da and 65 are readily determined 
graphically because of the fact that the regression of y on x always passes 
through the points on the ellipse at which the tangents to the ellipse are 
vertical. | 

When the frequencies are projected in a direction parallel to aa or bb, the 
same frequency distributions will be found for the adjusted y-values as in 
Fig. 1. Hence, in this case, the usual covariance analysis, which ignores the 
fallibility of x, will reach the erroneous conclusion that the difference between 
groups A and B on variable y cannot be accounted for by the difference on 
variable z. It may also happen (as will be illustrated later by a numerical 
example) that the usual covariance analysis will fail to detect a statistically 
significant difference between groups A and B on variable y when such a dif- 
ference actually exists and can be detected by proper statistical procedures. 
Attention has previously been called to this inadequacy in [8], and presumably 
in other publications unknown to the present writer. Effective methods for 
dealing with the corresponding problem as it appears in partial-correlational 
analysis have been formulated by Stouffer [6], [7]. 

The purpose of the present article is to outline an appropriate procedure for 
dealing with analysis-of-covariance data in the simple case where there are two 
groups, composed of individuals on each of whom we have one measurement 
on the criterion va ‘iable and two duplicate measurements on a control variable. 
(The criterion variable may be fallibly or infallibly measured—no adjustment 
is required in either case, any more than it would be if there were no control 
variable, in which case a simple t-test would be made, regardless.) 

The procedure to be developed here is a large-sample procedure, lacking the 
elegance of the analysis of covariance. Similar procedures could be worked out 
for the case where the variance of the errors affecting the control variable is 
known; also for the case where there are more than two replicate measures on 
the control variable for eavh individual. It is to be hoped that improved, small- 
sample procedures will eventually be developed for dealing with all these prob- 
lems, and that the results will be extended to cover more general cases where 
there are many groups, with cross-classification, and many replicate measure- 
ments on each of several predictor variables. 


: 


310 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1960 


1. AN APPROACH TO THE PROBLEM 


Consider a case where duplicate, fallible measures have been obtained for 
each of N individuals on the control or predictor variable, so that the size of 
the errors of measurement can be estimated. The problem will involve the fol- 
lowing six mathematical variables: 


y, the “criterion” or dependent variable; 

, the infallible “control” or independent variable, not directly observable; 
xz and X, duplicate fallibly measured values of &; 

e=x—£, and E=X —&, the errors of measurement for z and X, not directly 
observable. 


The present paper will be concerned with two separate populations, which will 
be distinguished by the use of a single and a double prime. Wherever it is rea- 
sonable to do so, symbols without primes are to be interpreted as referring to 
either population indifferently. The sample data will consist of N’ observations 
from one population and N” from the other. 

If — were actually observed, the standard analysis of covariance would 
assume the values f, and fo, +++, to be fixed values 
not subject to sampling fluctuation. For any value, §=£,, the distribution of 
y would be assumed to be normal, with a variance, o7.;, that is the same for all 
€ in both groups, and a mean, w,,, that is a linear function of &. Ho, the null 
hypothesis to be tested (the hypothesis that the two regressions are collinear), 
could be written 


by’ — — BR” — =0 (1) 


where £ is the regression of y on é for either group, ~ is the mean of the N’ 
values of &, uw; is the mean of the N’ corresponding values of yy,, etc. 

In the present problem, the same assumptions will be made and the same 
Hy will be tested. Since the values of £ are not observable, however, further 
assumptions are required before the necessary parameters can be estimated 
from the sample measurements available. It will be assumed that when §=&, 
the chance variables e and E are normally distributed, independently of each 
other and of y, with a mean (u,,) of zero and a variance (.2) that is the same 
for all values of & and for both populations. 

The significance test for Hy to be outlined here is a large-sample procedure, 
developed along lines several of which were suggested by Professor John 
Tukey. The basic step will be to obtain a sample estimator of the left side of 
(1) that is asymptotically unbiased. Since the sample estimator will be a 
continuous function of moments with continuous first derivatives, it will be 
approximately normally distributed in large samples. The large-sample stand- 
ard error of this estimator will be derived. If a 5-per cent (1-per cent) two-tailed 
significance level is used, H» will be rejected whenever the unbiased estimator 
differs from zero by more than 1.96 (2.58) times its standard error. 

The reader will have recognized that we are dealing here with a problem 
in linear structural relations. For either group considered alone, the structure 


a 
A 
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escapes unidentifiability because of the duplicate measure on the control 
variable. This duplicate measure (X) serves as an “instrumental variable”; 
more obviously, the duplicate measurements provide a good estimate of the 
variance of the errors of measurement, and this is all that is needed to make 
the structure clearly identifiable. The bare fact of identifiability is proved 
here by equations (14) through (20), which uniquely express all the population 
parameters for the unobservable variables in terms of those for the observable 
variables. For the relation of the present problem to well-known works on 
linear structural relation, the reader is referred to section 4 of Madansky’s 
thorough survey and summary [4]. 


2. COMPUTATIONAL FORMULAS AND NUMERICAL EXAMPLE 
The unbiased sample estimator to be used for the left side of (1) is 


U = — 9) — +X" 2) 


where # is the sample mean of the dependent variable, and X are the means 

of duplicate (fallible) measures of the independent variable, and §? is an esti- 

mate of the population regression coefficient 8, as defined below. The super- 

script 0 is used, when necessary, to indicate that the statistic to which it at- 

taches relates to both groups together rather than to either taken separately. 
Let 


= + 8xy)/2, (3) 
bes = (82 + 8x)/2, (4) 
N* =N’+N", (5) 
k = + (6) 
v = + (7) 


8:x)/N®, (8) 


where the s’s are observed variances and covariances. Then, 


w N°w? 


w= (N’ sex +N 


It is assumed here and throughout that the population reliability coefficient 
(p) and the corresponding sample value (r-x), 


(10) 


are neither zero nor indeterminate, since the present problem would have no 
practical significance for any data not meeting these assumptions. 

The large-sample standard error of U can be obtained from the following 
equations, summarized here for the reader’s convenience: 


(57) 


g = (N's, + )/N’, 


82x 
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Var = 2—) + g(k + |, (58) 
Nw? w? w 2 
1 1 vy? 
1 
+ + X" — — Var 8°. (60) 


The raw data for the numerical example are given in the first half of Table 1. 
The numerical values are abstracted from Table 9 of Adjustment to College 
by Frederiksen, N., and Schrader, W. B. (Educational Testing Service, 1951); 
except that all values involving X (to be distinguished from z) are hypothetical, 
since no duplicate measure of the predictor variable was available. The values 
in the second half of the table were computed from the raw data in the first 
half. The table, together with the equations to which it refers, provides a 
complete guide for practical computation. [The statistical results obtained 
here are not to be compared with Frederiksen and Schrader’s original conclu- 
sions, since they took into account an additional predictor variable not used 
in the present analysis. | 

The data in Table 1 were purposely selected from among various sets avail- 
able so as to illustrate the fact that the present method can demonstrate a 
significant deviation from the null hypothesis in data where no such deviation 


TABLE 1. COMPARISON OF AVERAGE GRADES EARNED BY MALE 
AND BY FEMALE COLLEGE FRESHMEN 


Male Female 


Number of cases (V°=N’+N’"’ =212) N’=119 N’’=93 


Freshman average grade (y): 


Mean 5’ =1.40 =1.57 

Standard deviation sy = .75 = 61 
Predictor variable (xz): 

= Mean #'=X'=4.07* | 2" = ¥=5.34" 

| Standard deviation sf =sx =2.30* |i! = ax’ =1.97* 
Correlations: rey =rxy = .65* fey = rxy= .46* 


rzx = .80000* .72738* 


Szy = cy =8x, =1.1212 | =.55278 
| 82x 82x = 4.2320 82x = 2.8229 
(eqs. 6 and 4) k= 4.6719 
=) (eqs. 7 and 3) v= .87185 
= (eq. 8) w= 3.6139 
ra (eq. 57) g= -47898 
(eq. 58) Var  .0004657 
Ay The test statistic, U (eq. 2) U=— .1358 
= | Standard error of U (eq. 60) S.E.v=J/Var U= :08053 
8 Ratio of U to its standard error U/S.E.uv = —1.69 
Significance level (one-tailed test) P= .05 


* All values involving X (as distinct from z) are hypothetical. 
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312 
q 
} 

: 


LARGE-SAMPLE COVARIANCE ANALYSIS 313 


would be shown by the usual analysis of covariance methods. The analysis 
of covariance for the actual data (ignoring the hypothesized variable, X) 
results in an F ratio of 0.67. This corresponds to a Student’s ¢ of t= +/F = —.821, 
which gives a one-tailed significance level of P =.20. This value of ¢ is virtually 
the same as the “critical ratio” U/\/ Var U =.826 that would be obtained by 
the present method (see eq. 61) if the predictor variable, x, contained no errors 
of measurement, so that r-x = 1.00. 

If the duplicate measurement X is used in the usual analysis of covariance 
as a second predictor variable, the standard error of estimate of the criterion 
variable is decreased, but by less than 5 per cent—from s,.2= 8,11 —r3, =.76s, 
to sy.2x = sy Thus the value of t=/F obtained by the 
usual analysis of covariance would still be less than .9. When the errors of 
measurement in xz are taken into account by the present method, however, 
the critical ratio (comparable to ¢) is found to be — 1.69, as shown at the bottom 
of Table 1. This value is at the one-tailed 5-per cent significance level. 

If these data were not in part hypothetical, and if there were some justifica- 
tion for the use of a one-tailed rather than a two-tailed test, one would be 
tempted to conclude that the boys did better in their college work than did the 
girls after allowance is made for the fact that the girls were initially better 
equipped intellectually than the boys. Actually, the data are presented for 
illustrative purposes only, and no broader interpretations should be drawn. 

Application of the present method may give either higher or lower signifi- 
cance levels than the conventional analysis of covariance methods, which 
ignore errors of measurement in the predictor variable. Since it is obviously 
unreasonable to match experimental and control groups on errors of measure- 
ment, the usual analysis of covariance methods are logically inappropriate in 
situations where substantial errors of measurement are present in the predictor 
variable. 


3, DERIVATIONS AND PROOFS 
Relations Between Observable and Unobservable Statistics 
The assumptions already mentioned about the errors of measurement may 
be written 
Mea = MEBs = Me = we = O, (11) 


= = Oe = OR, (12) 


= = Soaks = Tey = = Ook = Cee = = 0, (13) 


where each statistic refers to a single population of individuals. The symbols in 
(13) represent covariances. A symbol containing the subscript “a” pertains 
only to the case where § = £,; a symbol not containing the subscript “a” pertains 
to an entire population. For example, o.,y,=&(¢a—He,) (Ya—My,), Where &4 
signifies an expectation for some given value of a; but 


1 N 
Cy = Ea(€a — Me) (Ya — My): 
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Because y, e, and E are normally distributed, the joint frequency distribu- 
tion of all five chance variables is completely specified by its first- and second- 
order moments. The following paragraph shows that for either population all 
such moments involving unobservable variables can be expressed in terms of 
those involving observable variables only. 

By substituting «—e or X —E for & in (14)-(17) below and using equations 
(11) to (13), it is readily verified that 


g = ts = MX; (14) 


o(= Oz~e,X—E) = (15) 


Cty = Ory = (16) 
Ot: = = (17) 


By substituting x—£ for e and X —¢ for Z in (18)—(20) and using the foregoing 
equations, it is further verified that 


Ce = os 02x; = (18) 


ox = = 0, (19) 

The means of the three unobservable variables (£, e, and Z) are expressed in 
terms of the means of the three observable variables (y, x, and X) by (11) and 
(14). Equation (14) imposes one restriction on the three observable variables: 


Me = bX. (21) 


The variances of the unobservable variables are expressed in terms of the 
observable variables by (15) and (20). Taken with (12), these impose the 
further restriction that 


Oz = ox. (22) 


The twelve covariances involving unobservable variables are expressed in 
terms of observable variables by the thirteen equations given by (13), (16), 
(17), (18) and (19). Again a single restriction is imposed on the observable 
variables by these equations (specifically by (16)): 


Ory = Oxy. (23) 


Since all parameters for the population can now be expressed in terms of those 
of the observable variables, all can now be estimated from sample data, and 
any appropriate null hypothesis can be subjected to empirical statistical test. 

The next step is to use equations (11) to (23) in order to express Ho in terms 
of the observable variable z instead of the unobservable variable ¢. By a stand- 
ard formula for any regression coefficient, 8y;=o4,/0%. Hence, by (17) and (16), 


(24) 


q 
Ory OXy Ory OXy 
B= Bye = — = — = = 
2X 2X rX aX 
=| 
| 
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From (1) and (14), Hy may now be written in terms of observable variables: 
My’ — wy — — uz) = 0, (25) 


the value of 8 being obtained from (24). 

As already mentioned, these results are essentially the same as those that 
arise in the theory of structural regression when there are instrumental vari- 
ables. Equation (24) may be viewed as a specialization of a formula given by 
Tukey [9] and denoted in Madansky’s survey [4, sect. 4.1] as (II-V)/(I-IV). 


Sampling Variances and Covariances for the Unobservable Statistics 


Two useful relations derivable from basic general formulas and (14) to 
(24) are listed here for convenient reference: 


oy = On = Bor, (26) 
= + B ot. (27) 


It must be understood that all sampling variances, covariances, and ex- 
pected values to be derived in this paper are conditional moments of the sam- 
pling distribution for fixed values of ¢. Furthermore, these conditional moments 
are those of the asymptotically approximating sampling distribution for large 
N’ and N”’. Thus, for example, in spite of the fact that in finite samples neither 
the expected value nor the sampling variance of U exists, because of the theoret- 
ically possible occurrence of infinite values of the ratio 6°, it can neverthe- 
less be asserted that for fixed values of ¢ the test statistic U is asymptotically 
normally distributed (see [2, sects. 28.1-28.4]) with a mean of zero and a 
specified variance (Var U). The necessary asymptotic moments of the sam- 
pling distribution can be obtained by “propagation-of-error,” using the “delta” 
method with first, and sometimes second, derivatives [3]. In sufficiently large 
random samples, drawn from some population with fixed values of &, the ratio 
U/(Var U)*? will fall outside the limits +1.96, for example, approximately 
5 per cent of the time. The set of all possible sample points, regardless of £, 
for which —1.96< U/(Var U)*/?<1.96 thus provides an appropriate acceptance 
region for a large-sample significance test of H, at the 5-per cent level. 

Quite a number of asymptotic variances and covariances will need to be 
derived for the case where the values of ¢ do not fluctuate from sample to sam- 
ple; the desired formula for Var U will eventually be built up from these. 
To start out with, clearly, 


2 


1 
Var = (28) 


(29) 
ar a 


Cov (%, 9) = 0. (30) 


(Here and elsewhere either a prime or a double-prime may be added to each 
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statistic in the equations.) The sampling covariance of of with any other statistic 
is obviously zero; likewise, 
Cov (Ste, 8:z) = Cov (See, Sy) = Cov (Sez, Sey) (31) 
= Cov (Sez, = 0. 
The fact that the ¢’s are fixed does not affect the values of Var 8,2, Cov (sez, Sey), 
Cov (sey, Szy), and Var s,,, which according to the usual formulas are 


Var s eS. (32) 
ek 


1 
Cov (sez, Sy) = + eam) = 0, (33) 


Cov (sey 881) = (aay + = 0, (34) 


22 


1 
Var 8, = Gy. (35) 


Let &Q denote the expected value of any statistic, Q. Terms 0(1/N?) will 
generally be neglected; approximate equality will be indicated by the symbol 
=. Since &s;.=0, e and EZ are independent, and &(E,— E) =0, 


Cov (Ste, Sez) 


1 


1 
— — 2) (es — 28(Es — E). 
a 
= 0. 


It is similarly found that 


Cov (Sty, Sey) = Cov (Sey, Szy) = 0. 
Next, 


Var s;, = 
1 2 
1 
DX DX (& — — HE(ea — (es — 2) 
4 a 


1 22 


N 
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; 

(37) 
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(except for quantities 0(1/N?) that arise because s;. is computed from é rather 
than from 


Cov (Ste, Sey) = E8¢e8ey 


1 1 


1 
& — — ales — 2 (yo — 5) 
| 


1 
(a — £)8(€a — (es — — 9) on 


1 
x (Ea — E)E(ea — 2)°E(Ya — 3) 


1 ¢ 
— &) (ya — 5) 
1 


N 


(note that So that &(y.a— 5) #0). 
Var s:, may best be evaluated by a slightly different approach, setting &—£ 
==, and ¥.—j=Ya: 


1 
Var sy, = Var 


Sampling Variances and Covariances of Observable Statistics 
Except for (28) to (30), all the foregoing sampling variances and covariances 
are for statistics involving the unobservable variables e or £. They are needed 
as a step in finding the sampling variances of observable statistics. Thus, 
2 
= = + Ste + Sez + Sex. (41) 
It follows from the symmetry of e and E that 
Var = Var + 2 Var s;. + Var 
+ 4 Cov (se, + 2 Cov (se, 
+ 2 Cov 8x) + 4 Cov Sex). 


| 
1 
>, Var Y. (40) 
1 
= N2 Fy-t =a 
2 2 
= 
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The first variance and all the covariances in (42) vanish; hence, from (38) and 
(32), 
2 2 2 
Var s:x = + (43) 
Similarly, 
2 


1 
Cov (8zy, 82x) = (44) 


1 
Var tay + (45) 


Cov (82y, 8xy) = (46) 
N 
The Test Statistic as an Unbiased Estimator 


The next task is to prove that the test statistic, U of (2) is an unbiased 
estimator of the left side of (25) except for quantities 0(1/N). A first step is 
to find the asymptotic expected value of the ratio 


ree 


+ bey 


v 
w 


Set »=&v, V=v—v, w=Sw, W=w—w. It is seen with the help of (26) that 


[(N’ — log + (N” — 


(47) 


1 , ” 
[((N — lo, +(N 


v 
(48) 
The present problem is of no interest if #=0, so w¥0 will be assumed through- 
out. 
Now, 


identically. Following the “delta” method, expand each fraction on the right 
into a power series in W: 


Thus 
v V v 
+ 
w W+w W+eo 
4 W+o o w w? 
(49) 
rp. V 1 (v VW + ) 
W+o w w 
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Take the expected value of (49) to find that 
v v Varw Cov (v, w) 
1+ 


@ vw 


A rigorous justification of this procedure is given in [3]. 
From (44), since statistics from different groups are independent, 


(53) 


Now k—vw is an unbiased estimator of 02. It follows that 8° is an unbiased 
estimator of B=v/w. 

Since 6° is a function of second moments, and since y, e, and E are normally 
distributed for fixed £’s, 8° is independent of the other statistics in U, which are 
all first moments. Hence, finally, U is an unbiased estimator of the left side of 
(25), except for terms 0(1/N"). 


Standard Error of the Test Statistic 
From (7), (3), (45), and (46), it is found that 


Vv ae 2 1 2 
arv = Ve Wo 


y = (N’o, 
From (51), (52), and (55), by the delta method, 


w? 


w? 


v 1 2v 
Var — = Var w + Varv — — Cov x) | 


Var f° is the same as (56) up to quantities 0(1/N' ey, 
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(50) 
1 2/ N’ N” ” 
Cov (v, w) = Wo Ce Oxy + Wo os) 
1 
= Wo (51) 
Similarly, from (3) and (44), 
1 2 
Thus, 
2,2 
&—=—]1 
w w | * 
where 
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From (27) and (26), 


= oy — Boy =o, — Bory. 


Thus an adequate sample estimator of o7.¢ is g—v?/w where 
0 
g= (Ns, + Ns, )/N. (57) 
Since k—w is an unbiased estimator of o?, an adequate sample estimator of 
Var #° is 
1 k? k 1 
Var B° = E (= —2 ~) +—g(k+ u) |. (58) 
w? 2 


From (28) to (30), since 8° is capes of the other statistics in U, 
Var U + (<i + 0) 


A sample estimator for (59) is 


1 1 
V tex Ae k— 
ar U =z) + ( | 
1 
+ 4 +X” — # — X’)? Var 8°. (60) 


Equation (60) gives an estimate of Var U for use in actual calculations, the 
value of Var 6° being obtained from (58). 


4, THE SPECIAL CASE WHERE THERE ARE NO ERRORS OF MEASUREMENT 


As already pointed out, the usual analysis of covariance proceeds as if the 
predictor variable, z, were measured without error. This is a limiting case of the 
general situation where z contains an error of measurement. In the limiting 
case, =X and o?=0. Making these substitutions in (2) through (9) and in 
(58) and (60) yields a set of formulas representing a large-sample approxima- 
tion to the usual analysis of covariance. 

It is found for this limiting case that w=k and that 


N” N° 
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VARIANCE OF THE MEDIAN OF SAMPLES FROM 
A CAUCHY DISTRIBUTION 


Pau R. Riper 
Aeronautical Research Laboratories, Wright-Patterson Air Force Base 


Exact values of the variances of the medians of small samples from 
a Cauchy distribution are given. They are compared with the values 
obtained by using the formula for asymptotic variance. 


frequency distribution which is of considerable theoretical interest in sta- 
tistics is the Cauchy distribution, given by the formula 


f(z) = [1 + @ (1) 


It is well known that the distribution of means of samples from (1) is exactly 
the same as (1) itself and that consequently the mean is no better than a single 
observation as an estimator of the parameter @ [2, p. 2]. The variance of the 
mean is infinite. 

Maximum likelihood estimates of @ have a variance of 2/n in large samples 
of size n, while the variance of the medians of large samples is r?/4n [1, p. 708]. 
The latter does not provide a very good estimate of the variance of the medians 
of small samples, as will be shown. 

It is easily deduced that the frequency density function of the median of a 
sample of 2k+1 observations from (1) is given by 


1 


arctan? (x — + (2 — (2) 


(See Fisher [1, p. 715].) 
The variance of the median may be found by multiplying (2) by (x—6)? and 
integrating between the limits — «© and . It is convenient to make the sub- 
stitution ¢=arccot (c—9). The required variance will then be 


2(2k + 1)! 
(k!) 


f — cot? odd. (3) 
0 


The integral in (3) converges for k>2, otherwise it diverges. Values of (3) 
for k=2 (1) 15, that is for sample sizes 5, 7, 9, - - - , 31, were obtained by 
numerical integration. Acknowledgement, with thanks, is made to Dr. Gertrude 
Blanch for reducing (3) to a form which was convenient for computation and 
to Mr. Donald 8. Clemm for performing the actual computation. 

Values of the variance are shown in Table 1. For purposes of comparison, 
values of +?/[4 (2k+1)], the approximate variance in large samples, are also 
listed. 

The intrinsic accuracy, defined as the variance of the logarithmic derivative, 
with respect to the parameter, of the density function, is considered more 
relevant than the variance in small samples in both Fisherian and Bayesian 
theory of estimation. Values of the intrinsic accuracy are given in the final 
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VARIANCE OF CAUCHY MEDIAN 


TABLE 1. VARIANCES OF SAMPLE MEDIANS 


Sample size, Variance of 7 Intrinsic 
2k+1 median 4(2k +1) accuracy 


2.467400 - 50000 

.822467 -09064 
1.221253 -493480 - 74552 
-612082 -352486 -44042 
-408659 .274156 . 16164 
-306803 224309 -90109 
- 245622 - 189800 -65369 
204798 164493 -41608 
- 175618 .145141 - 18596 
- 153719 129863 -96171 
136678 .117495 .74214 
123040 107278 -52636 
-111877 -098696 -31370 
102572 -091385 - 10363 
-094696 -085083 .89574 
-087944 -079594 -68970 


CONOR 


column of the table. Grateful acknowledgement is made to Mrs. Martha Elmore, 
who, under the direction of Dr. Blanch, computed these values (with the ex- 
ception of the first six, which are to be found in [1, p. 716]). 
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TESTS OF THE HYPOTHESIS THAT A LINEAR REGRESSION 
SYSTEM OBEYS TWO SEPARATE REGIMES* 


Ricuarp E. QuaNDT 
Princeton University 


Several approaches are explored for testing the hypothesis that no 
switch has occurred in the true values of the parameters of a linear 
regression system. The distribution of the relevant likelihood ratio \ 
is analyzed on the basis of the empirical distribution resulting from 
some sampling experiments. The hypothesis that —2 log \ has the x? 
distribution with the appropriate degrees of freedom is rejected and an 
empirical table of percentage points is obtained. Finally some small 
sample tests are suggested. 


1, INTRODUCTION 


I A previous article! the author has suggested a method of estimating the 
parameters a), bi, 01, a2, be, o2, and ¢ of a system 


Ys = + + 
Yi = + bor; + i=t+1,---,T7 


where the u; are independently and normally distributed with mean zero and 
variances o; and o3 respectively, and where we are given 7 independent pairs 
of observations (2, y1) - - - (er, yr). The major problem was the estimation of 
the location of the unknown switching point t. Two ways in which one might 
test the hypothesis that no switch occurred against the single alternative that 
one switch took place were also suggested. The purpose of this paper is to 
examine several alternative tests. 


2. LIKELIHOOD RATIO TESTS 


The relevant likelihood ratio previously derived? is 


T-t 


where é; and é» are the standard errors of estimate of the left hand and right 
hand regressions respectively, ¢ is the standard error of estimate of the overall 
regression based on all observations, and where ¢t has been chosen so as to mini- 
mize \. It was conjectured that the distribution of —2 log \ may be approxi- 
mated for large 7 by the x? distribution with 4 degrees of freedom. 


* The calculations for the sampling experiment described in Section 2 were performed on an 1BM 650 com- 
puter through the courtesy of the Econometric Research Program at Princeton University. I am especially grateful 
to Dr. H. Karreman who spent countless hours of his time programming the problem and running it on the com- 
puter. My thanks are also due to my colleague, F. F. Stephan, to J. Pfanzag] of the University of Vienna and to 
J. Durbin of the London School of Economics for helpful criticism and advice. The responsibility for errors is, of 
course, mine. 

1 Quandt, R. E., “The Estimation of the Parameters of a Linear Regression System Obeying Two Separate 
Regimes,” Journal of the American Statistical Association, 53 (1958), 873-80. 
Ibid., p. 876. 
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This conjecture was examined with the aid of a sampling experiment de- 
scribed below. Under the null hypothesis the observations are generated by a 
single regime. The variable x was assumed to be nonstochastic with the particu- 
lar values .05, .10, .15, - - - , .95, 1.00. Two hundred z-series with 20 observa- 
tions each, 200 series with 40 observations each and 200 series with 60 obser- 
vations each were generated. Each of the z-values was used once for the series 
with T =20, twice for the series with T =40 and three times for the series with 
7 =60. Corresponding u; were generated from the normal distribution with 
zero mean and unit variance. Finally, the value of \ was calculated for each 
of the 600 series. 

It was not necessary to calculate the observed y; by assuming particular 
values of a and b in y;=a+bz;+u,;; nor was it necessary to perform the calcu- 
lations for several alternative values of o because the likelihood ratio is inde- 
pendent of a, b, and c. This can be proved as follows. Since the estimate for 
6] is given by 


t 


and since 


and d=a-— z+ i, 


substitution gives 6? = 6? —62,/6? for the left hand regime, and similarly for 6} 
and 6. Hence the error variances are independent of a and b. We further ob- 
serve that the likelihood ratio is homogeneous of degree zero in 61, 62, ¢. This 
completes the proof of the assertion. 

The x? Conjecture. The x? distribution with 4 degrees of freedom was fitted 
to each of the empirical distributions of —2 log \. The hypothesis that each of 
these distributions was generated by the x? distribution with 4 degrees of free- 
dom was tested by the x? criterion and was rejected in every case with a sig- 
nificance level of .00001.’ It is quite clear that the x? approximation fails com- 
pletely. The concrete reason for this is that in the case of the present empirical 
distributions the bulk of the observations is much farther from the origin than 
is the case for the x? distribution with 4 degrees of freedom. A possible theo- 
retical reason for this behavior is as follows: in order that —2 log \ have the 
x? distribution (and that maximum likelihood estimates be asymptotically 
normally distributed) one has to assume that the derivative of the likelihood 
function vanishes at the maximum. Since the likelihood function is defined only 
for integral values of the parameter ¢, this is a bad assumption for any value of 


* Mann, H. B., and A. Wald, “On the Choice of the Number of Class Intervals in the Application of the Chi 
Square Test,” Annals of Mathematical Statistics, XIII (1942), 306-17. The number of classes k is set equal to 
4(2(N —1)2/ct)/s where c is given by (1/V2x) fo exp (—2z*/2)dz =probability of Type I error. The coefficient c was 
chosen to be 1.96 and thus k =30. This differs from the correct figure (31) but was found to be much more con- 
venient. The resulting reduction in the power of the test is likely to be negligible. See Williams, C. A., “On the Choice 
of the Number and Width of Classes for the Chi-Square Test of Goodness of Fit,” Journal of the Amer ‘can Statistical 
Association, 45 (1950), 77-86, esp. 85-6. 
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7. The empirical observed distributions for —log \ (rather than —2 log \) are 
given in Table 1. 


TABLE 1. OBSERVED FREQUENCY DISTRIBUTIONS OF —LOG A 
FOR VARIOUS VALUES OF T 


Observed Frequencies for 


—log 
T =40 


2.0000- 3.9999 
4.0000— 5.9999 
6.0000—- 7.9999 
8.0000— 9.9999 
10 .0000—11 .9999 
12 .0000—13 .9999 
14.0000-15 .9999 
16 .0000-17 .9999 
18 .0000—19 .9999 
20 .0000— 


An Empirical Test. An empirical table of percentage points could be derived 
with high degree of reliability if the three empirical distributions could be 
pooled. This procedure could not be utilized, however, since the null hypothesis 
that the three observed distributions were generated by the same underlying 
theoretical distribution must be rejected. Positing the null hypothesis, the 
probabilities of greater discordance among the distributions according to vari- 
ous test criteria are listed in Table 2.‘ 


TABLE 2. PROBABILITIES OF WORSE FITS ACCORDING 
TO VARIOUS TEST CRITERIA 


Probability of Worse Fit 


x? contingency test -00741 
Kruskal-Wallis test .00013 
Kiefer test .00500 


The null hypothesis that no switch occurred can be tested with reference to 
the empirical table of percentage points. Table 3 lists for various values K the 
estimated probability that —log \< K and also 95 per cent confidence intervals 
around the estimates. The cumulative distributions indicate that an increase 
in T shifts the distribution to the right without, however, materially affecting 


See Kruskal, W. H. and W. A. Wallis, “Use of Ranks in One-Criterion Variance Analysis,” Journal of the 
American Statistical Association, 47 (1952), 583-621; Kiefer, J., “K-Sample Analogues of the Kolmogorov-Smirnov 
and Cramer-y. Mises Tests,” Annals of Mathematical Statistics, 30 (1959), 420-47. The results of these tests were 
completely supported by pairwise Kolmogorov-Smirnov and Wilcoxon-Mann-Whitney tests. 

5 The upper and lower bounds of the confidence interval were calculated by solving the equation 


(1 — p) 
200 


where f is the observed relative frequency. 


| 16 2 1 
29 19 23 
45 44 31 
36 52 39 
36 44 50 
20 17 27 
ae 7 9 9 
rae 7 6 10 
=. 1 3 6 
3 4 4 
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the right hand tail of the distribution: for values of K greater than 19.0 the 
absolute differences between pairs of cumulative relative frequencies is already 
very small. On the basis of this table the null hypothesis could not be rejected 
in the example analyzed in an earlier article.‘ 


TABLE 3. PERCENTAGE POINTS FOR —LOG A WITH 95% 
CONFIDENCE INTERVALS 


T =20 T =40 T =60 


Lower p ( 


Upper | Lower 5 Upper | Lower - Upper 
B(-log <K B(-log <K 


Bound | Bound Bound | Bound Bound 


(.008) -021 (.052) | (.000) (.019) | (.000) (.019) 
(.049) 080 (.127) | (.004) ° (.041) | (.000) . (.030) 
(.087) 126 (.179) | (.022) d (.081) | (.023) . (.081) 
(.175) 228 (.291) | (.073) . (.160) | (.083) . (.175) 
(.305) 370 (.439) | (.160) . (.273) | (.135) ° (.243) 
(.386) 454 (.524) | (.265) (.394) (.344) 
(.477) 547 (.615) | (.378) (.515) (.457) 
(.564) 633 (.697) | (.520) (.656) (. 542) 
(.692) 756 (.811) | (.637) (.763) (.663) 
(.751) 811 (.860) | (.744) : (.854) J ‘ (.778) 
(.822) -875 (.914) | (.788) (.889) (.854) 

912 

928 

967 

984 

985 

986 

987 

988 


SEE 


(.864) (.944) | (.841) (.928) (.848) 
(.893) (.957) | (.874) (.950) (.919) 
(.932) (.985) | (.894) (.964) (.937) 
(.955) (.995) | (.898) (.965) (.964) 
(.957) (.995) | (.929) (.983) (.976) 
(.958) (.996) | (.947) (.991) (.986) 
(.959) (.996) | (.954) (.994) (.994) 
(.961) (.997) | (.958) (.996) (.996) 


3. SMALL SAMPLE TESTS 
Test 1. Assume that a maximum likelihood estimate ¢* of t has been found. 
This value of ¢ divides the observations into a left hand and a right hand 
group for which separate regressions are obtained. Denote the estimated re- 
gressions by 
= + bz; (3-1) 
9s = Ge + box, ¢=f*+1,---,T. (3-2) 
Calculate the 9’s from the left hand regression using the x’s belonging to the 
right hand group and the y’s from the right hand regression corresponding to 
the z’s belonging to the left hand group. Form the residuals of the type 
(3-3) 
ro = 0, + bit; y; t=f+1,---,T (3-4) 
and obtain their means 
1= DX (G2 + — ys) (3-5) 
i=1 


T 


1 
(4, + bz; — (3-6) 


T —t i=t*41 


* Quandt, R. E., op. cit., 870-2. 
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Under the nuli hypothesis the terms summed in (3-5) are normally and inde- 
pendently distributed. Similarly, the terms summed in (3-6) are normally and 
independently distributed. Writing 


be + bet; — ys — 


(4, + biz; Yi — Fe)? 


2 
82 = 


T-#-1 


the quantities 


— E(f:))/s: and WT — t*(F, — (3-9) 


would appear to have the Student-t distribution. The null hypothesis will be 
rejected if either of the observed mean residuals is significantly different from 
zero. 

There are two difficulties with such an approach. First, the expected values 
of #, and 7 


tml 
t* 


= (a2 — a1) + (b2 — bi) (3-10) 


= (a1 a1) + (3-11) 


can equal zero not only under the null hypothesis (7.e., when a; =a and b; = be) 
but also when b, be, provided that 


T 
(az — a;)/(b2 — = — a/t*=— 2/(T (3-12) 


In other words, the expected value of the test statistic can equal zero under the 
alternative hypothesis as well. This would tend to diminish the power of the 
test. However, this is not likely to be a source of serious difficulty. In order 
that (3-12) should hold, the sample must lie in a space of dimension T-2 (since 
(3-12) represents two independent linear restrictions on the z’s). Unless the 
joint distribution of the z’s is singular’ such an event will occur with probability 
zero. 

A more serious objection is that the distribution of the test statistics will be 
affected by ¢* having been estimated from the data. This very probably makes 


7 Cramer, H., Mathematical Methods of Statistics (Princeton: Princeton University Press, 1946), p. 297. 


| 
& 
and 
t* 
> 
Lie 


TESTS FOR SEPARATE REGIMES 329 


the use of the Student-t distribution with ‘*—1 degrees of freedom (or with 
T —t*—1 degrees of freedom for the second test statistic) incorrect. 

Test 2. In order to avoid the difficulty mentioned in the previous paragraph, 
divide the observations into two groups of t and 7'—t observations where 
t= 7/2 if T is even and t=(T'—1)/2 (or (7+1)/2) if T is odd. Let the residuals 
be defined as in (3-4). Then the quantities 


(3-13) 


have the Student-t distribution with 7 —t—1 and t—1 degrees of freedom re- 
spectively. If the null hypothesis is false, it is plausible that the power of the test 
will be the greater the closer the switching point is to ¢, 7.e., the less each esti- 
mated regression is contaminated by data generated by the other regime. One 
factor seems to indicate that this test might generally be more powerful than 
tests in which the denominator of the test statistic is s; (or s2) rather than the 
standard error of estimate é; (or é:). One would generally expect the standard 
errors of estimate to be less than the standard deviations of the residuals if 
the null hypothesis is false. Therefore, for given values of #, and 7: the absolute 
values of the test statistics would tend to be larger under Test 2 than under 
Test 1, thus making it easier to reject the null hypothesis when it is false. Un- 
fortunately, the possibility of contamination which is the consequence of divid- 
ing the data into two groups arbitrarily has the opposite effect on the power 
of the test. The first factor will predominate if the true switching point is near 
the center of the time series, but one cannot say which factor will be more 
important if it is not. 
Test 3. Under the null hypothesis the quantities 


T 
(a2 + bor; — yi)?/o? and (a2 + box; — Y:)?/o? 


imt+1 


are independently distributed as x? with t—1 and T —t—1 degrees of freedom 
respectively. Their ratio therefore has the F-distribution with t—1 and T—t—1 
degrees of freedom. An analogous F-ratio can be calculated by utilizing the 
calculated left-hand regression. The null hypothesis will be rejected if either 
of the observed F-ratios is too large. This test avoids both of the difficulties 
connected with Test 1, but clearly is subject to the objection that the power 
of the test is related to the location of the true switching point. 


4. CONCLUSION 


It was attempted in the present paper to indicate various approaches to 
testing the hypothesis that no switch occurred in the parameters of a linear 
regression system against the alternative that one switch took place. A sampling 
experiment was performed and its results led to the rejection of the hypothesis 
that —2 log \ has the x? distribution. The sampling experiment indicates that 
the distribution of the logarithm of the relevant likelihood ratio is not inde- 
pendent of the length of the series. An empirical table of percentage points was 
derived from the data for T = 20, 40 and 60. 


VT oft — 
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Various small sample tests are suggested. Each of the tests depends upon 
dividing the observations into two groups. On the whole it seems preferable to 
make this division arbitrarily (in which caee the power of the test will tend to 
be reduced) than to use a maximum likelihood estimate for the switching point 
as the dividing line (in which case the distribution of the test statistic cannot be 
ascertained). A Student-t test also suffers from the relatively minor disadvan- 
tage that the expected value of the test statistic can equal zero under the alter- 
native hypothesis as well, albeit with probability zero. Finally an F-test is 
suggested for testing the null hypothesis.*® 
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A NOTE ON AVERAGE TAU AS A 
MEASURE OF CONCORDANCE 


L. Hays 
University of Michigan 


A computationally convenient way to find average tau is pointed 
out, and the possibility is raised of simple chi-square tests of significance 
for average 7 as well as Kendall’s coefficient of agreement u. A method 
for obtaining the rank order of “best fit” to a group of rankings in 
terms of average tau is also discussed. Finally, an “analysis of agree- 
ment” is proposed based upon a partition of average tau into agreement 
within and between experimental groups. 


1. INTRODUCTION 


HE well known “coefficient of concordance,” W, of Kendall [2, p. 95] is a 

measure of the average agreement among a set of rank orders of n objects 
given by m judges. It is a simple function of the average Spearman rank corre- 
lation (rho) among all 3(m)(m—1) distinct pairs of judges. There is, however, 
an alternative measure of concordance which parallels the Kendall W, except 
that it is a similar function of average Kendall tau among all pairs of judges. 
The use of average tau as an appropriate way to measure concordance was first 
pointed out by Ehrenberg [1], who showed that computationally, average tau 
is identical with the coefficient of agreement, u, described by Kendall [2] for 
paired-comparison data. Ehrenberg also investigated the distribution of aver- 
age tau, and proposed a chi-square approximation to the sampling distribution 
under the null hypothesis of minimal agreement among a population of 
rankers. 

The purpose of this note is to raise the possibility of a test for the significance 
of u and of average tau for samples of moderate size somewhat simpler than 
those given by Kendall and by Ehrenberg, and also to point out the possi- 
bility of an “analysis of agreement” for several groups of judges based upon a 
partition of the “total agreement” by groups. 


2. AVERAGE AGREEMENT WITHIN A SINGLE GROUP 


Consider a set of n objects which are to be rank-ordered by each of m 
judges. If we let 7 stand for a particular pair from among the (9) pairs of 
objects, we may define a;;=1 if judge j agrees with some arbitrary ranking of 
pair 7, and a;;= —1 if judge 7 disagrees with the arbitrary ranking of the pair 7. 
In these terms, Kendall’s tau for judges j and k might be written as 


2 


(a,j x) = (1) 


representing as it does the difference in proportion of object pairs for which 
there is agreement minus the proportion for which there is disagreement for 
the two judges. Now if we also define 


ay = a 


331 
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and sum a;—™m over all (3) pairs of objects, 
2 


n(n — 1)(m)(m — 1) es 


=7 (2) 


where 7 is the average Kendall tau between all }m(m—1) pairs of judges in 
ranking the n objects. This may be simplified by letting x; equal the number 
of judges who agree with the arbitrary ordering of the pair 7, and y; equal the 
number who disagree with the ordering, so that m=2;+y;, a;=2;—y;; with 
these substitutions average tau becomes 


2 (2x, — m)? 


n(n — 1)(m)(m—1) m—1 


= 7. (3) 


Parenthetically it will be noted, of course, that the arbitrary ranking as- 
signed to any pair of objects 7 is completely irrelevant to the end result. How- 
ever, should this criterion ranking to which the ordering for each judge is 
compared be meaningful, within the experiment, then it becomes possible to 
obtain another average tau as a kind of “bonus” from these data by noting that 


2a; 


1 = (4) 


m(n)(n — 1) m(n)(n — 1) 


where 7¢ is the average of the m taus between each judge and the criterion 
ranking. Or, if the initial ranking to which the judges are compared happens 
to be purely arbitrary, but it is desired to find the rank order of “best fit” to 
all of the judges in the sample, the 7¢ might be useful in this connection. 
Kendall recommends the estimation of a “true ranking” from a set of m rank- 
ings by taking the rank order of the sums of the ranks of the objects across 
judges. However, a perfectly good alternative would seem to be to estimate 
that rank order as the true one which maximizes 7c. This may be done quite 
simply by putting the rank-order data into a table showing the number of 
cases in which the row object is ranked above the column object (as in the 
example to follow) and permuting rows and columns (simultaneously, so that 
rows and columns show the same order) until the sum of the entries above the 
diagonal is maximal. The required rank-order may then be read directly from 
the order of the rows or columns in the table. This method seems as intuitively 
appealing as Kendall’s, especially if average tau rather than average rho is 
chosen as the measure of agreement among judges. It will be noted that 
Kendall’s method may, but does not always, maximize 7c. 

The average p calculated from Kendall’s W cannot be less than —1/m—1, 
while average 7 has as its minimum value —1/n—1 for m even, and —1/m for 
m odd. Just as pf may be turned into an index W ranging from 0 to 1, so 7 
can be turned into a measure of agreement W, also ranging from 0 to 1, thus: 


iy hi 
| 
rin 
d 


AVERAGE TAU AS A MEASURE OF CONCORDANCE 


(5) 
= m odd. 
m+1 


It follows from (5) that in an infinite population of rankers, W, and 7 are 
identical. 

As noted above, average 7 as given by expression 3 is computationally 
identical with the method for computing Kendall’s coefficient of agreement u 
for paired comparison data [2, p. 149]. However, the sampling distribution of 
u and average tau under the usual hypotheses of paired comparisons differ 
from their sampling distribution under hypotheses appropriate to ranking 
situations. 

Some features of average 7 in the limit are of interest in this connection: 
expression 3 above may be rewritten as 


1 1 


where p; is the proportion agreeing with the arbitrary rank order for pair 7, 
and q; is the proportion disagreeing with the order for that pair. For random 
sampling with replacement 


1 
E(7) = (bs — 4)? = Wi (7) 
where $;, 9;, and W, are population values. 


Furthermore, note that from expression 7, population 7=0 (or population 
a& =0) implies and is implied by 


Dd (bi — 9)? = 0. 


This can be true if and only if $;=9;=}3 for any pair of objects ranked or 
compared. 

This fact suggests the possibility of a simple large-sample test of the hypothe- 
sis Hy: a=0, for pair-comparison data. Kendall [2, p. 153] tests this hypothesis 
in terms of the statistic 


which is distributed approximately as chi-square with 


333 
W.= m even 
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degrees of freedom. Obviously, v is not usually integral-valued, which leads to 
a certain amount of inconvenience. However, for any sample 


2>5(2 


the form of the familiar Pearson chi-square for such frequencies with expecta- 
tion m/2. Moreover, as m—> , v->(). The statistic (10) has variance 


n\m—1 
2( ) 
2 m 
which approaches the variance of chi-square with (3) degrees of freedom. 
This leads one to the conjecture that with reasonably large n, the ordinary 


Pearson chi-square criterion with expectation equal to 3m and (5) degrees of 
freedom might give a reasonable fit to the null distribution given by Kendall. 


TABLE 1. PROBABILITIES THAT CERTAIN VALUES OF u WILL BE 
ATTAINED OR EXCEEDED WHEN POPULATION u EQUALS ZERO 
n=4, m=6 


u Exact Kendall Pearson 
.088 .209 
.110 .175 
.132 .133 
.154 .097 
.073 
.198 .057 
.220 .043 
.242 .029 
.264 .020 
.276 .016 
.298 .011 
.320 .0072 é -015 


Some support for this conjecture is given by a comparison of the exact 
probabilities of various u values with those given by the two methods. For ex- 
ample, in Table 1 the exact probabilities for m=6, n=4 are compared with 
those arrived at by the Kendall criterion and by the Pearson chi-square with 
expectation 4m. It can be seen that the fit of the Pearson criterion is not so good 
as that of the Kendall for m and n this small. However, the fit does seem close 
enought to lead to the expectation of a satisfactory degree of approximation 
by the simpler method for large m and n. Computational simplicity might 
favor the Pearson chi-square in many instances of fairly large samples. 

On the other hand, when ranking methods are employed and the hypothesis 
that population 7=0 is to be tested, the problem becomes more difficult. 

Ehrenberg found that 


| 
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(m — 2)(2n? + 6n + 7) . 


should be distributed approximately as chi-square with 


” = 2)*(2n? + 6n + 7)? 


degrees of freedom. He further compared the approximate values provided by 
this chi-square statistic with exact probabilities obtained for small values of 
n and m. These comparative probabilities are reproduced (with minor cor- 
rections) as part of Table 3. It will be noted that the Ehrenberg chi-square 
necessitates interpolation for fractional degrees of freedom, so that only ap- 
proximate probabilities are shown here. 

In principle, it should be possible to find some statistic with form a7+b simi- 
lar to Ehrenberg’s, giving a reasonably good fit to the required null distribution 
for large m and n , but having the advantage of integral valued degrees of free- 
dom. The writer is indebted to the referees of this paper for the following 
suggestions. One possibility is to choose an integer number g of degrees of 
freedom near v, and then to choose a and b so that a+b has mean g and vari- 
ance 2g. This implies that 


(12) 


189 n\/{m 


Now as 


2 / (2n? + 6n + 7)? 


Expression 14 becomes of the order of n as n>. In fact, in the limit for m, 
with increasing size of n, vy as given in (14) appears to converge on n fairly 
rapidly (e.g., 3.32 for n=4, 4.45 for n=5, 5.57 for n=6, 9.72 for n=10, 20.14 
for n= 20, and so on). 

Thus it appears that a reasonable large sample approximation should be 
provided by 


+n (15) 
with 


335 
(14) 
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distributed approximately as chi-square with n degrees of freedom. 
On the other hand, the factor 


m(m — 1) 
(m — 2)? 


is not inconsiderable for small m, so that the approximation should be much 
inferior to the Ehrenberg approximation for small samples of judges. 

Asa rough check for these conjectures, and in order to see within what range 
of m the approximation (15) might be acceptable, the values of average tau 
required for significance at the 5% level were calculated, first for the statistic 
(15) and then using the Ehrenberg statistic. These are shown in Table 2, for 
various values of m and n. The values required by (15) with correction for 
continuity are shown as the upper entry in each cell, and those from the 
Ehrenberg chi-square, similarly corrected, as the lower entry. It is obvious 
that the fit is quite good throughout the table; in fact, the fit is surprisingly 
good even in the lower values of n and m. As a further check on the extent of 
this agreement between the two approximations, probabilities obtained from 
each were compared with the exact probabilities presented by Ehrenberg for 
the case of n=3 and m=4, n=3 and m=5, and n=4 and m=3. These appear 
in Table 3. It is quite apparent that the Ehrenberg probabilities agree better 
with the exact values than do those given by (15). Nevertheless, the probabili- 
ties given by the latter approximation are not as deviant as one might reason- 
ably expect. These results suggest that the simpler statistic may be useful in 
testing the significance of average tau, at least for moderate to large m and n. 
There is at least some computational convenience afforded by the integral 


TABLE 2. APPROXIMATE VALUES* OF AVERAGE TAU REQUIRED FOR 
SIGNIFICANCE AT THE 5 PER CENT LEVEL, USING BOTH 
azt+n (UPPER ENTRY) AND THE EHRENBERG CHI-SQUARE 
(LOWER ENTRY), UNDER THE NULL HYPOTHESIS 7=0 


15 


20 


* Corrected for continuity by the addition of 


| 
e | 4 5 6 7 s 9 10 
5 .333 .268 .198 .178 162 149 
.260 191 .173 .158 
10 .118 091 0816 .0715  .0693 
.137 096 .090 .0750 .0683 
m 
0960 0671 0593 0535 .0490 0454 
0944 .0657 .0585 .0525 .0484 .0450 
m | .0699 .0573 .0494 .0437 .0362  .0335 
.0566 .0493 .0453 .0393 .0360 .0332 
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TABLE 3. PROBABILITIES THAT VARIOUS VALUES OF AVERAGE 
TAU WILL BE ATTAINED OR EXCEEDED WHEN POPULATION 
AVERAGE TAU EQUALS ZERO 


n=3, m=4 
Using Ebrenberg chi-square (approx.) .130 

n=3, m=5 


T 
Using Ehrenberg chi-square (approx.) é .045 
Using at-+n i -048 

n=4,m=3 


T 
Using Ehrenberg chi-square (approx.) 4 .205 


valued degrees of freedom which it utilizes. On the other hand, for very small 
m and n, the Ehrenberg approximation is probably preferable. 


3. AN ANALYSIS OF AGREEMENT FOR SEVERAL GROUPS OF JUDGES 
Suppose that we had several groups of judges representing different condi- 
tions or experimental treatments, such that each judge provides a rank order 
of the same n objects. Let there be R different groups labelled (1, 2,---, 
r, +++, R), with m, different judges to a group r, and with T=m,+me+ --: - 
+mez in the total group. 
Now the overall degree of agreements in the total group can be computed by 


T(T — = - T. (16) 


( ) 


7,= the average agreement between judge pairs within group r 
7,,=the average agreement between judge pairs drawn from group r and 
group s. 


Then 


= > 


-22 
-273 
.20 
.214 
-33 

-132 
-10 
.102 
We define 
| 
+ ar | = 
er k r 
so that 
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TT 2 T1 2 T2 9 Tr 12 


18 
+ + rs. 


In short, we can partition the total “agreement” existing in the group as a 
whole into agreement between pairs within the same experimental group and 
agreement between pairs in different experimental groups. This is analogous, 
of course, to the partition of the sums of squares in analysis of variance. Ob- 
viously, by a similar argument, agreement in terms of W or average p may 
also be partitioned into agreement within and between groups. 

Utilizing the notation in 6 and 7 above, one finds that in the limit for 
group r 


(Bir — Gir)? 
(3) 


and for groups r and s 


If the population were homogeneous with respect to the agreement within and 
between groups (that is, if membership in the R respective groups had no 
influence upon the relative agreement to be expected between pairs of indi- 
viduals) then for any pair of groups, 1 and 2, for the population average tau, 7 


T1 = T2 = T12 
Furthermore, if this is the case in the populations, 
T1 Te = 2712. 


This means that 


2. (pia — Ja)? + (Piz — Ji2)? = 2 (Pir — Far) (Pi2 — 


Since for any given pair of objects 7 
(Pir — Gis)? + — Yaz)? — Jar) (Diz — 
then 
(Bir — Gis)? + (Pi2 — Fiz)? = 2(Pia — Gus) — 
if and only if pi = pi2; it thus follows that 7,=72= 7,2 if and only if 
Ba = Bir = Bi 


for every pair of objects 7 in the set of (3) such pairs, in the population. 
Thus, one might cast the hypothesis of homogeneity in the form that for a 
particular pair of objects 7, and for a particular arbitrary and constant ranking 
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TABLE 4, AGREEMENTS AND DISAGREEMENTS WITH AN ARBITRARY 
ORDER FOR THE EXAMPLE 


B 


ADS 


7 
17 
9 
8 
7 


Total Group 


of these two objects, p;, the probability in general of agreeing with the arbi- 
trary ranking is the same as f,,, the probability of a judge in population r 
agreeing with the arbitrary ranking. In other words, we form the hypothesis 
that 


Ho: Bi = Par Pir, for all 1. 


The ability to frame the hypothesis in such terms tempts one to the con- 
jecture that a relatively simple chi-square statistic might serve to test this 
hypothesis of equal agreement within and between groups. This is, unfortu- 
nately, at this stage only a conjecture, and further work is required before the 
feasibility of such a test of “between” and “within” agreement becomes 
apparent. 

There is, incidentally, no reason why the partition is limited to average 7 
based on rankings. Exactly the same scheme may be followed with wu for 
paired-comparison data, and the hypothesis of equality of between and within 


A _| | C D E F 
_ 3 11 6 14 14 
13 _— 12 14 16 15 
5 4 _ 6 11 13 
- 10 2 10 _ | 14 12 
2 0 5 2 — 8 
2 1 3 4 8 —_— 
Group I 
A B c D |_| F 
—- 2 2 1 3 5 
14 _ 9 3 8 11 
14 — “4; 3 8 9 
15 13 13 _ 10 13 
13 8 6 
11 5 7 3 9 _— 
Group II 
A B | C |_| E F 
A — 5 13 17 19 
B 27 —_ 21 24 26 
Cc 19 11 19 22 
D 25 15 23 24 25 
E 15 8 13 _ 15 
F 13 6 10 17 -—~ 
if 
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population agreement may be framed in the same way. Here too, it seems 
reasonably safe to assume the potential exists for a relatively simple test. 
4. AN EXAMPLE 


Given two groups of sixteen judges each, their rank-orders of the same six 
objects are: 


ABDCFE 
ACFBDE 
BACFDE 
BEADFC 
BAEFDC 
CBFDAE 
CDBAFE 
CDABFE 


BADECF 
BDAECF 
BDACEF 
BDACFE 
BCDAEF 
BDCFAE 
BDEAFC 
BDACEF 


DCBAFE 
DFCBEA 
DBFCAE 
DBFCEA 
DBECFA 
FDEBCA 
DFECBA 
BFDCEA 


EBDACF 
ABDCFE 
EDBCAF 
EDCBFA 
ECFDBA 
CEDBFA 
DBFCEA 
ECDAFB 


I II 


The problem is to measure the agreement among judges within each group, 
and to compare agreement within and between the two groups. 

We being by arranging these data into three tables; one each for group I, 
group II, and total. Each table entry shows the number of judges for whom 
the row letter comes before the column letter in the rank order for the judge. 
The total table is simply the cell-by-cell sum of the other two tables. These 
tables are shown here as Table 4. 

Let x; be the number in a cell above a main diagonal. From the tables, the av- 
erage 7 measures are computed as follows: 


TI 


— D(n)(n—1) — 1 
7 (6 — 16)? + (22 — 16)?+-:-:- + (16 — 16)? | ae 
7 (120) (30) 


= 16)? + 4 = 16)? + (14 16)? 
(120) (30) 
(10 — 32)? + (26 — 32)? + + (80 32) 


(496) (30) 32 


= = 175. 
Then, from expression 18, average + between members of different groups is 
given by: 
(496) (.175) — (120)(.340) — (120) (.207) 
= = .0828 
(16) (16) 

Note that the “arbitrary order” becomes, in this case, simply the natural 
order of the letters of the alphabet. The average tau between 32 judges and 
this arbitrary order is, from 4 above. 


l 
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2(263) 

32(15) 


If, however, the interest lies in finding a rank order which is a “best fit” to 
those given by the judges, in the sense mentioned above, permuting the rows 
and columns of the matrix simultaneously until the sum of the entries above the 
diagonal is maximal gives the required order BDCAFE to the rows and 
columns, with 


max 7c = .354. 


The orders BDACEF and BDACFE fit group I equally well (max 7¢ =.558) 
while the order DBCFEA is a best fit for group II (max 7¢=.417). 
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ESTIMATION IN THE TRUNCATED POISSON DISTRIBUTION 
WHEN ZEROS AND SOME ONES ARE MISSING* 


A. Ciirrorp CoHEN, JR. 
University of Georgia 


This paper is concerned with maximum likelihood estimation of the 
Poisson parameter \ when the zero class has been truncated and when 
errors of observation have resulted in eliminating some though not 
necessarily all of the one class. Estimators are derived both for \ and 
the proportion @ of ones eliminated as a consequence of faulty observa- 
tion. A table and a graph of the estimation function involved are given. 
Asymptotic variances and covariances of the estimators are derived, 
and a table of the variance function applicable in calculating Vi) is 
given. An illustrative example is included. 


1. INTRODUCTION 


N SAMPLING from a truncated Poisson distribution with missing zero class, 
i] errors of observation sometimes result in a further partial truncation of the 
one class; that is, some of the ones are in effect eliminated from the population 
being sampled. Suppose for example, the number of defects in a manufactured 
item is a Poisson distributed random variable and that only items free of defects 
are acceptable. Suppose further that it becomes necessary to estimate the mean 
number of defects produced per item (the Poisson parameter) from a sample 
selected entirely from rejected items accumulated during inspections, and that 
the number of acceptable items inspected is not known. Inspections may or 
may not have been performed on all items produced. Because of certain 
psychological inspector bias, some few items containing “only one defect” may 
have passed undetected and thus have been eliminated from the population 
being sampled. We assume that similar errors of observation did not occur with 
respect to items containing two or more defects each. 

As another example, consider the similar problem of estimating the mean 
number of insect eggs per nest where a nest can be recognized as such only 
if one or more eggs are present, the number of eggs deposited per nest being a 
Poisson random variable. When faulty observation results in overlooking some 
of the nests containing only a single egg, while nests containing two or more 
eggs are correctly observed, the resulting sample is also of the type described 
in the preceding paragraph. 

Let x designate the random variable under consideration. With \ designating 
the Poisson parameter and @ the proportion of ones eliminated as a conse- 
quence of faulty observation, the probability function may be written as 


(0, z=0, 
(1 — 
t= 
p(x;,0) = [1 — + @)] (1) 
en? 
, x = 2, 3, P 
[1 — + ar) Jz! 


* Sponsored by the Office of Ordnance Research, U. 8. Army. 
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where \>0 and 0<@<1. When @=0, we have the truncated Poisson distribu- 
tion from which the zero class is missing, and when @= 1, we have the truncated 
Poisson distribution from which both the zero and the one classes are miss- 
ing. 

In this paper, we obtain maximum likelihood estimators for the parameters 
d and @ plus their asymptotic variances and covariances. The techniques em- 
ployed and the results obtained here are closely related to previous studies of 
the truncated Poisson distribution by David and Johnson [3], Finney and Var- 
ley [4], Moore [5], Plackett [6], Rider [7], Tate and Goen [8], and this 
writer [1], [2]. 


2. MAXIMUM LIKELIHOOD ESTIMATION 


Consider a sample consisting of N observations of random variable x. The 
likelihood function is 


P(x, ++ + 9) = {- — + Or) a![1 — + 


(2) 


where n; is the number of ones in the sample, and II* is the product over all z’s 

that are greater than one. Taking logarithms of (2), differentiating with respect 

to \ and @ in turn, and equating to zero leads to the estimating equations 
mi/N = (1 — — e(1 + @)], 


= (& — O(/(1 — 6), 


where Z is the mean of the entire sample 


z= x;/N. (4) 


The required estimates, which we designate \ and 6 to distinguish them from 
the parameters estimated, may, when they exist, be found by simultaneously 
solving the two equations of (3). Solving the second equation of (3) for 6 yields 


= [N& — /n*z*, (5) 


where n* is the number of sample observations greater than one (n* =N—n), 
and #* in their mean; that is, 


#* = / (6) 


in which n, designates the sample frequency of x. On substituting (5) into the 
first equation of (3), we obtain 
b(A) = (1 — e)/[1 — = (7) 


Since Z* involves only the n* sample observations for which «>2, estimating 
equation (7) derives no information from the ones in the sample. In other 


344 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1960 


words, when some unspecified proportion of the one class is known to be 
missing, that entire class is disregarded in calculating the maximum likelihood 
estimate of X. 

As derived above, estimating equations (5) and (7) are applicable for 
0<6é<1. However, it is relatively easy to derive the maximum likelihood esti- 
mating equation for a truncated Poisson distribution from which both zero and 
one classes are entirely eliminated and thus demonstrate that (7) is also ap- 
plicable in the special case when @=1. 

To facilitate solution of (7), Table 1 was compiled with b(\) = z* given as a 


TABLE 1. MAXIMUM LIKELIHOOD ESTIMATION FUNCTION FOR 
TRUNCATED POISSON DISTRIBUTION FROM WHICH ALL ZEROS 
AND SOME OR ALL ONES ARE MISSING 

= (1 —e>)/[1 —e*(1 +a) ] 


0.2 0.3 0.4 0.5 0.6 0.7 0.8 


2.0691 2.1051 2.1425 2.1810 2.2208 2.2617 2.3040 
2.4856 2.5342 2.5841 2.6354 2.6880 2.7418 2.7970 
3.0309 3.0926 3.1556 3.2198 3.2853 3.3521 3.4200 
3.7036 3.7774 3.8522 3.9281 4.0050 4.0830 4.1619 
4.4869 4.5703 4.6546 4.7396 4.8254 4.9119 4.9991 


5.3545 5.4448 5.5356 5.6270 5.7189 5.8112 5.9040 

6.2792 6.3739 6.4689 6.5642 6.6599 6.7558 6.8520 

7.2389 7.3362 7.4336 7.5313 7.6290 7.7270 7.8250 

. 8.2185 8.3172 8.4159 8.5147 8.6136 8.7126 8.8117 

9.0100 .1093 9.2086 §=©9.3079 9.4073 9.5067 9.6062 9.7057 9.8053 


10.005 10.204 10.304 10.403 10.503 10.603 10.703 10.802 
11.002 ‘ 11.202 11.302 11.401 11.501 11.601 11.701 11.801 
12.001 . 12.201 12.301 12.401 12.501 12.600 12.700 12.800 
13.000 . 13.200 13.300 13.400 13.500 13.600 13.700 13.800 


Instructions: Enter table with #* =0(A) and obtain { by inverse linear interpolation. 


function of \. With #* available from the sample data, inverse linear interpola- 
tion in this table immediately yields the required estimate 4. A graphic aid 
that is quite easy to use and which is almost as accurate as inverse linear inter- 
polation in Table 1 has been included as Figure 1. This is a folded scale graph 
of 6 as a function of #*, where 4 is a correction to be subtracted from #* in esti- 
mating \. By definition, 5=Z*—,, and with 6 rather than 2d plotted as the 
ordinate, it has been possible to improve the accuracy attainable in a chart of 
fixed size. 

It is not difficult to construct samples for which (5) and (7) fail to yield ac- 
ceptable estimates of \ and @. However, for large N such samples are quite im- 
probable when probability function (1) applies, and their occurrence in practical 
applications should be interpreted as a suggestion that some probability func- 
tion other than (1) is needed as a model to describe the random variable ac- 
tually under observation. 


3. SAMPLING ERRORS 


The asymptotic variance-covariance matrix of (4, 6) is obtained by inverting 
the matrix consisting of elements that are negatives of expected values of the 


q 

| 0.0 0.1 0.9 
i) 0 2.0000 2.0337 2.3475 
1 2.3922 2.4382 2.8535 
‘Sais 2 2.9114 2.9795 3.4892 
te 3 3.5595 3.6310 4.2418 

7 4 4.3226 4.4043 5.0870 

he 5 5.9972 

6 6.9484 

7.9232 

q a 8 8.9108 
9.9049 

10 10.902 
11.901 
12 12.900 

13 13.900 
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ees! 


Fig. 1. Maximum likelihood estimation chart for truncated Poisson distribution 
from which all zeros and some or all ones are missing. 


Instructions: Select applicable segment of horizontal axis and locate point corresponding to ple value of 
2*, Extend projection vertically from this point to graph segment lettered to correspond with the horizontal axis 
segment. Project intersection thus determined horizontally to read § on vertical scale. Compute k =#*—3. Example: 
Given =2.94, read 3 =0.89, and =2.94 —0.89 =2.05. 


second order derivatives of logarithms of the likelihood function P. Thereby 
asymptotic variances and covariances follow as 


V(A) ~ 22/N — $12), 
Cov (A, 6) ~ $12/N (1122 — 


and the coefficient of correlation between \ and 6 as 


6 

2.0 2.2 2.4 2.6 2.8 3.0 3.2 3.4“ 
3.4 $6 3.8 4.0 4.2 4.4 4.6 
4.8 5.0 5.2 5.6 5.8 6.0 $02 © 
a) 714 77.6 ° 
(9) 
where 
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\ an? {1 — e-*(1 + 0a) ]? 
- ao) 
N [1 + N 
1 aL ) — + d)] 
N \oe/ 
In accordance with common usage, ¢;; is an abbreviation for ¢;; (A, 4), E( ) 


denotes expected value, and L is written for 1 in P. 
On substituting the ¢;; from (10) into (8) we obtain 


VA) ~ [\/E(n*)]¥Q), (11) 


where 
E(n*) = — = — +))/[1 + )]. 
For N sufficiently large, V (A) can be approximated using (11) with parameters 


replaced by their estimates. From (12) and the first equation of (3) it follows 
that 


E(n*) = N[1 — e*(1 — + = n’*, (13) 
and accordingly 
Va) = (14) 


On the basis of the derivation given here, (11) is valid for 0<6@<1. If, however, 
6=1, in which case all zeros and ones are missing from the distribution, it can 
be shown that (11) is applicable in that case also with E(n*) =N =n*. 

The variance function is continuous, monotonic decreasing, lim, = 3, 
and limy..¥(A)=1. Therefore regardless of the value of \, the asymptotic 
variance satisfies the inequality 


d/E(n*) < VA) < 3d/E(n*). 


TABLE 2. THE VARIANCE FUNCTION 
v(a) = [1 —e(1 +a) ]2/[(1 —e)2 ] 


~ 


» 


. 230486 
.170777 
125480 
.066001 
-033527 
.016476 
.007857 
-003649 
-000060 


.333439 
266571 
208412 
.926785 
.710239 
-541391 
-410145 
-308605 


| 


3.000000 
902293 
.809070 
. 720123 
.635274 
.554347 
-AT7174 
.403590 


o 


Noor 


 & 


le 
ae. 
346 
Be 
(15) 
1 
2 
~ 
; 2 
3 
? 


ESTIMATION IN TRUNCATED POISSON 347 


As a computational aid for use in practical applications, selected values of 
¥(A) are included in Table 2. 


4, AN ILLUSTRATIVE EXAMPLE 


To illustrate the practical application of results obtained in this paper, an 
example considered by Finney and Varley [4, page 388] has been suitably 
altered. The example chosen is a distribution of the number of gall-cells pro- 
duced in flower-heads of the knapweed by larvae of the knapweed gall-fly, 
observed by Varley [9] in 1936. For purposes of this illustration 30 of the zero 
observations are omitted from the original data. Both the original and the 
altered data together with expected values of the altered data are given in the 
accompanying table. 


DISTRIBUTION OF GALL-CELLS IN FLOWER-HEADS 
OF BLACK KNAPWEED 


Altered Data 


No. Gall-cells Original Sample 
Frequencies t Observed Expected 
Frequencies Frequencies 
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t Observed by Varley [9] in 1936. 


For the altered data, N = 259,n,; = 60, n* = 199, NZ = Tf an, = 645, 
= Z=2.22414, 2.93970, and =0.09302. On entering 
Table 1 with the above value of 2*, inverse linear interpolation yields \= 2.048. 
The chart of Figure 1 can be employed to read §=0.89, and to two places of 
decimal we calculate 2.9397 —0.89 = 2.05. 

Had we chosen to neglect the 60 observations of the one class, our estimate 
of \ based on the remaining truncated sample from which both the zero and 
the one classes are missing would be unchanged. Had we chosen to disregard 
6 or more precisely to assume @=0, which is equivalent to considering the 
altered sample with €=2.22414 as being truncated with only the zero class 
missing, the procedure described in [2] gives }= 1.887. This same procedure 
gives \= 2.028, which agrees with the estimate obtained by Finney and Varley 
(loc. cit.), when calculations are based on the original unaltered sample. With 
replaced by }=2.048 in (5), we calculate 6=0.3075, which estimate is to be 


| | 
7 90 60 
96 96 
57 57 60 
26 26 31 
10 10 12 
4 4 ( 4 
5 5 1 
0 0 
1 
I 0 | 0 
| | | 4.71 
P | 0.196 


348 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1960 


compared with 0.3333, the proportion of ones actually eliminated from the 
original sample in construction the altered sample. 

As indicated by the low value obtained for x?, the agreement between ob- 
served and expected frequencies in the altered sample is satisfactory. With 3 
degrees of freedom, P(x?>4.71) =0.196. The value x%)=4.71 for the altered 
sample is to be compared with x%)=4.87 obtained by Finney and Varley in 
comparing frequencies of the original sample with expected frequencies based 
on the unaltered sample estimate of X. 

As a preliminary step in calculating variances and covariances of estimates 
obtained here from the altered sample, we employ (10) with \ and @ replaced 
by their estimates to calculate $::=0.381352, ¢2=—0.243408 and 
$22 = 0.371140, where is written for ¢;;(4, 6). On substituting these values 
into (8) and (9), we calculate V(A) =0.0174, V(6) =0.0179, Cov (6, 4) =0.0094, 
and pt3=0.647. For comparison, we also calculate V(X) using (14). Linear 
interpolation in Table 2 gives ¥(2.048) = 1.69 and from (14) we have 


V (A) = [2.048/199 ] (1.69) =0.0174, 


as before. For the estimate based on the original unaltered sample consisting 
of 289 observations, Finney and Varley computed V (4) =0.0088. 
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Efficient Conversion of Non-Metric Information into Metric Information. Ronert P. Apetson, Yale University. 


Qualitative information about sets of entities is often via inequalities (rank order scale, ordered metric scale, 
etc.). In the case where the entities are identified as levels of an independent variable, a maximum procedure may 
be used to determine an efficient set of numbers (the maximin sequence) to represent the levels. Goodness of fit is 
assessed by the square of the lowest possible product t correlation coefficient between the maximin sequence 
and a sequence in the convex set defined by the inequalities. 

Results are presented for rank order, “diminishing returns,” and ordered and higher-ordered metric scales. 
The values of r? are higher than one might expect, but lower than might be ideal. The more qualitative information 
one has (the stronger the inequalities), clearly the higher the r? value. The art of developing strong yet appropriate 
inequalities from available information is discussed. 


A Differential Method for Estimation of Type Frequencies in Triplets and Quadruplets. Gorpon ALLEN, National 
Institutes of Health. 
Bertillon and, later, Weinberg devised a “differential” method for estimating the proportions of identical and 
fraternal twins in any large series. An analogous method appears applicable to triplets and quadruplets when ac- 
curate data are available for very large populations. In addition to the fraction of sets concordant for sex, sufficient 
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in the case of twins, the method for triplets and quadruplets employs the calculated proportions of each type in the 
next lower order of birth. Thus, the calculations for triplets employ the proportion of mixed-sex triplets and the 
proportion of two-egg twins. These data are entered in a set of equations derived by combination and expansion 
of binomials. Errors inherent in the method seem to be tolerable for present purposes. 

When applied to recent U. 8S. birth statistics, the method indicates that the ratios of one-egg, two-egg and three- 
egg triplets are approximately 1:3:2 among Negroes, and 1:2:1 among Whites. The four ovularity types of White 
quadruplets occur approximately in the proportions 2:6:5:4. 


Irreversible Absorption Processes as Models for Failure with Applications to Life Testing. Wm.iam R. ALLEN, 
Princeton, N. J. 


Irreversible Absorption Processes with a finite number of states are studied as models representing mechanisms 
with many modes of failure. Special examples of such processes lead to mixed distribution models of Kao (Trans. 
IRE PGRQC-7, 1956) and Mendenhall (Proc. of Stat. Tech. in Missile Evaluation Symp., VPI, 1958), models simi- 
lar to those of Epstein (IQC, 1958) and Birnbaum and Saunders (JASA 53, 1958), Triggered Mechanism models, 
representations of life test experiments, representation of the failure of complex equipment and equipment deterior- 
ation models. A problem concerning accelerated life tests with many contributors to the hazard (Allen, Operations 
Research, Vol. 7, 1959) is examined in the light of this type of model. 


Trends in Productivity in the 14 Major U. S. Manufacturing Industries 1955-1959. Ta A. AN 


California Dept. of Finance. 


This paper analyzes the increases in productivity from the lst quarter of 1955 to the 1st quarter of 1959 for each 
of the twelve major manufacturing industries in the U. 8. economy. Each industry was analyzed in the following 
respects for this period. 

1. Changes in output 

. Changes in employment and manhours worked 

. Changes in productivity (output per inan hour of employment) 
. Changes in net plant and equipment per production worker 

. Changes in hourly wage rates for production workers 

. Changes in prices 

. Changes in the ratio of profits to net worth 

It was found that a closer torrelation existed bet price mov ts and changes in unit production costs 
than between prices and output. This suggests that under certain fairly broad situations shifts in costs are a greater 
influence on price than shifts in demand. Where unit costs are rising and demand is falling, prices usually will be 
raised. Where demand is rising by no more than 6% or 7% per year, and costs are stable, prices usually can be 
expected to be relatively stable. 

There are many reasons why shifting costs are relatively more important today than changes in demand as an 
influence on prices in most manufacturing industries. These reasons are set forth in this study. 


Classification into Multivariate Normal Distributions with Unequal Covariance Matrices. T. W. ANDERSON, 
Columbia University, anp R. R. Banapur, Indian Statistical Institute. 


Suppose a vector z has a multivariate normal distribution with mean vector wi and covariance matrix Li or 
mean vector uo(* 1) and covariance matrix D2: .We consider linear procedures for classifying an observation vector 
z as coming from one or the other distribution. Such a procedure is specified by a vector b and a scalar k; the rule 
is to classify from the first population if b’r >k. The admissible procedures within this class are of three types. One 
of these (including the minimax procedure) is generated by vectors of the form (p2:-+g22)~!(ui —u2), where O<p, 
q<l and p+q=1; the other two types are characterized similarly. For each vector there is a corresponding k. 
Some computational procedures are outlined. 


Factors in Labor Force Growth. Gertrupe Bancrort, Bureau of Labor Statistics. 


This paper summarizes briefly what has been learned about the factors affecting labor force growth, by means 
of the accumulation of 20 years of evidence from the monthly sample survey of the population and two decennial 
censuses. Some of the demographic factors not known or not measured 20 years ago include age of a woman's 
children, age at marriage, suburbanization of the population, population shortages in particular age groups, school 
attendance, and educational attainment. Other factors include availability of suitable employment, particularly 
part-time employment, income from sources other than earnings, and a complex of social and psychologicai influ- 
ences. Year-to-year changes in the size of the labor force have been far from uniform. Periods of rapid expansion, 
associated with mobilization or peacetime booms, have been followed by relatively slow periods which were not 
restricted to the recession phase of the cycle. Despite the accumulation of data over the past two decades, there 
is very lictle direct evidence on why specific types of individuals enter—or fail to enter—the labor force, or why they 
do—or do not—remain labor force members. Until statistical research and data collection provide these answers, 
interpretation of the magnitude of labor foree changes and their underlying causes will remain speculative and 
controversial. 


The Organization of Statistical Instruction in Colleges and Universities. T. A. Bancrort, Iowa State University. 


The form of the organization of statistical instruction in colleges and universities should depend upon such 
factors as the type, size, interests and needs of the particular college or university in question. Many smaller col- 
leges or universities may find that it is not feasible to organize a separate department of statistics. In such cases a 
departmental home should be found for at least a general principles course in statistics at the freshman or sophomore 
level and a beginning general course or courses in theoretical statistics with calculus as prerequisite. This latter 
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course, in such cases, would logically be offered in the department of mathematics. In addition more specialised 
beginning and advanced methods courses would naturally be offered, in such cases, in the respective departments of 
the specializations. Intermediate and larger size colleges and universities are now tending more and more to the 
offering of statistical instruction in independent departments of statistics and in many cases these departments are 
related to a statistical laboratory or institute. Such departments of statistics offer or expect to offer programs leading 
to the B.S., M.S., and in many cases the Ph.D. degree with major in statistics in addition to service courses for 
other departments. 


An Evaluation of Social Area Indices. James M. Besuers, Purdue University. 


The procedures and results of Shevky, Bell and Tryon are evaluated with an analysis of Cleveland 1950 data. 
Factor analysis and simplex analysis are used to examine the correlation matrix. Detailed relationships between 
variables and constructs are revealed which contribute to theoretical interpretation of the derived constructs. An 
alternative approach to index construction is considered. 


The Construction Statistics Program. Eumer Ciark Bratt, Lehigh University. 


A new program to provide construction statistics is getting under way. General supervision has been placed 
with the Bureau of the Census, where the work has been assigned to the Construction Statistics Office. 

An extensive re-examination of existing series and analysis of longer-run objectives is under way. Initial 
steps taken relate to experimental work on the measurement of housing starts and of residential repair and main- 
tenance. 

Looking toward better methods of current estimation new technqiues are being idered. For inst: , the 
checking of housing starts by aerial photos and by such records as electrical connections has gotten under way. 
A complete restudy of the phasing out of contracts and starts is planned for the near future. The potentialities of 
progress reporting as an alternate method of measuring put-in-place will also be tested. 

Such problems as shifting boundaries in geographical areas, changes in practices on inclusion of equipment with 
the structure, and shifts in proportion of activity between on the site and in the factory are leading into an analysis 
of the most basic problems. The very definition of construction is thereby being questioned. 

Pressing current problems, such as what to do about discontinuity which may creep into series with the use of 
new definitions and of better measurements are harrying. After decisions with respect to these problems are made 
there remain other problems such as measurement of the price of construction and of seasonal changes in it. 


Experimental Design in Weather Modification Experiments. Gurnn W. Brier AnD Hers Tuo, U. S. Weather 
Bureau 


The modern era of experimentation in weather modification began with laboratory “cold box” tests in 1946 
that were soon followed by successful attempts to produce visual changes in supercooled clouds by scattering dry 
ice from airplanes. These led to numerous amateurish and poorly designed experiments from which extravagant and 
unsupportable claims were made and subsequently exploited commercially. The confusion and violent controversies 
that resulted tended to draw attention away from the more sound scientific experimentation that was going on and 
which indicated a growing recognition of the importance of modern design principles. Against this general historical 
background, this paper discusses the progress that has been made on the problems connected with weather modifica- 
tion experiments and points out some of the difficulties and uncertainties that remain. 


The Decade Ahead in Shareownership Research. JonaTtHAN A. Brown, New York Stock Exchange. 


Paper wil! discuss seven areas in which shareownership research is expected to develop in coming decade. 

1. “Alpha-Seg” technique developed by New York Stock Exchange has proved sufficiently satisfactory to be 
used for future “nose-counts.” 

2. Data on personal characteristics of shareowners—i.e., age, education, income, and occupation—should be 
improved, however. 

3. Dollar ownership by income groups and/or proportionate ownership by decile groups is admittedly an 
important but difficult area. Progress may necessarily be limited to studies of a group of companies rather than all 
public corporations. ‘ 

4. More corporations will research their own shareowning families—such as the Ford and General Electric 
studies of recent years. 

5. Expect more research on Indirect Ownership—even beyond the SEC work on Pensions, and Mutual Funds; 
and the American Bankers Association studies of Personal Trusts. 

6. Increased attention will be given to the economic, political and social implications of broader shareownership 
(direct and indirect). 

7. Studies of shareownership in foreign countries—South America (Colombia) to Australia; South Africa 
(Johannesburg) to West Germany—will expand. 

Conclusion: Although shareownership research has progressed considerably in the past decade, unprecedented 
developments will occur in the next—with a large number of individuals and organizations contributing. 


Publications of the 1960 Census. Howarp G. Brunsman, Bureau of the Census. 


As in 1950, a series of population reports will be iseued for each State, and the content of the separate reports 
for a State will be shown in a bound part of the final volume. To expedite their publication, the 100 per cent data 
on general population characteristics will be presented in separate reports. The general arra t of the housing 
reports will also be similar to that for 1950, with statistics by blocks in a separate report for each of the larger cities, 
the general characteristics for smaller areas contained in State reports, and the more detailed cross-tabulations in 
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a separate report for each of the larger standard metropolitan statistical areas, and for rural housing by 120 sub- 
regions. 

The following new subjects will be covered in the reports of the 1960 population census: place of work of em- 
ployed persons in terms of cities of 50,000 or more and counties, means of transportation to work, whether persons 
attending school were attending public or private school, duration of residence in present quarters, date of first 
marriage, year last worked, veteran status of males (collected but not published for 1950), and language spoken in 
home by foreign-born persons before coming to the United States. Statistics on place of previous residence will relate 
to 5 years prior to the census, instead of to 1 year, as in 1950. ; 

The new housing subjects include: number of bedrooms, number of bathrooms, presence of a basement, home 
food freezers, clothes washing machines, dryers, telephones, air conditioning, and ber of aut biles; also, 
presence of elevators for units in larger cities and source of water supply and type of sewage disposal for units in 
smaller cities and rural areas. 

As in 1940, but not in 1950, the reports will include limited data on color, sex, age, marital status, and house- 
hold relationship for minor civil divisions. 

The published statistics for blocks in larger cities will include total population and average number of rooms 
in owner- and renter-occupied units, in addition to most of the subjects covered in 1950. Separate reports of the 
1960 housing census will contain the results of the supplementary surveys on the components of change in the 
supply of housing and on characteristics of financing of residential properties. 

It is expected that use of the electronic computers will expedite publication of the various reports, with the 
population reports of 100 per cent data and housing statistics for larger cities by blocks appearing early in 1961 
and the entire regular series completed by late 1962. 


Application of Linear Regression to Frequency Graduation. Ropert J. Buraier, Iowa State University. 


F. Y. Edgeworth, N. L. Johnson, and others have considered the problem of graduating an obrerved frequency 
distribution of a variate z by choosing parameters 6 of a function y = ¢(z7; 8) in such a way that y has approximately 
a normal distribution. In this paper the normal density is replaced by an arbitrary standard density g(y) and ¢ is 
given the special form (2; 8) = 28i¢;(z). The observed cell frequencies determine percentage points y; of the stand- 
ard density g(y); and if the true density of z has the parametric form implied by the choices of g and ¢, then for 
large samples y; tends in probability to ¢(2;; 8) where the z;’s are class boundaries. Regression of the observed per- 
centage points yj on the theoretical ones ¢(z;; 8) provides a method of estimating the parameters 8;. The variates 
yj are highly correlated, but from the known covariance structure of the multinomial distribution the covariance 
matrix of the y;’s can be estimated for use in the regression analysis. A numerical example is given of a cubic trans- 
formation to normality. 


‘The Use of Statistics in the Formulation and Evaluation of Broad Social Programmes. Octavio CaBe.io, United 
Nations Statistical Office. 


Social statistics are defined as the statistics required for social programmes, and a social programme as the 
scheme of public and private activities which have a direct bearing on a particular aspect of living conditions. It is 
assumed that there is a separate programme for each component of the levels of living, e.g., health, education, 
housing. 

The uses of statistics for the following aspects of social programming are examined and illustrated with examples 
from the field of housing: Estimating of economic resources for the programme; measurement of living conditions by 
statistical indicators; estimating of social needs, e.g. “dwellings,” to maintain or improve the levels of living of 
growing populations; testing the consistency of social needs and economic, human and material resources; and adop- 
tion of goals for social programmes consistent with expected resources. 

Several forms of evaluation are discussed, and the indicators of levels of living are considered to be the only 
comprehensive yardsticks of the success or failure of social programmes. 

The purpose of the paper is to provoke dissemination of information on this subject among under-developed 
countries. : 


Migration and Employment. Rosert O. Carveton, Puerto Rico Planning Board. 


Several unexpected features of Puerto Rican migration have been uncovered by a new sample survey instituted 
by the Commonwealth Government three years ago. 

Rising wage levels in Puerto Rico combined with a drastic reduction in travel time and travel costs have made 
possible for perhaps the first time in history a large-scale, long-distance migratory movement of low income people 
in which the participants are able to travel first and decide afterwards whether their purpose was to migrate or just 
to visit. An estimated 70% of the 400,000 or so trips made annually between Puerto Rico and the United States 
in the past two years were by persons whose eventual purpose was either to migrate or to visit relatives. 

The survey discloses the existence of heavy migratory currents (even when seasonal and other migration lasting 
less than a year is excluded) in both directions between Puerto Rico and the United States. Both the in and the 
out movements were larger than the substantial net migration. The analysis of net migration and its projection into 
the future is significantly affected by the discovery that it is the resultant of two strong and opposing migratory 
forces instead of roughly the equivalent of out-migration. 


Use of Standard Metropolitan Statistical Areas in Government. Viapimirn D. Cuavrip, anp Gasrie. CHERIN, 
Bureau of Employment Security. 


The growth in the acceptance and use both by government and private groups of the Standard Metropolitan 
Statistical Areas as a basic geographic unit for the compilation, presentation, and analysis of statistical data on an 
area basis are attributable to several factors. First, the SMSAs made possible the compilation of a wide variety of 
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social and economic statistics on a current and comparable geographic basis. Second, because the area boundaries 
were established so as to encompass both places of industrial concentration (labor demand) and of population concen- 
tration (labor supply), more meaningful analysis of local economic and social conditions is possible. 

Any concern with economic conditions and programs to meet specific needs must take into account the variations 
in local conditions. The widespread use of the SMSAs developed by the Bureau of bared Budget, in cooperation with 
several government agencies, has made it possible to relate, on a comparable geograp | basis, statistics on popu- 
lation, labor force, employment, unemployment, hours of work, earnings, labor turnover, industrial production, 
housing, vital statistics, and many other social and economic activities. The major producers of such data are 
primarily government agencies, such as the Bureau of the Census, U. 8. Department of Commerce, the U. 8. De- 
partment of Labor’s Bureau of Labor Statistics and the Bureau of Employment Security, with its affiliated State 
employment security agencies. Some private organizations also develop area statistics, usually for marketing pur- 
poses. 

The government agencies are also the major users of the SMSAs. The Bureau of Employment Security and its 
affiliated State employment security agencies make extensive use of area data for both administrative and program 
purposes. Similarly, the SMSAs play an important part in the Federal Housing Administration’s analyses of local 
housing markets and in the work of the Office of Area Development of the U. 8. Department of Commerce in assist- 
ing local communities to improve and expand their economies. Other government agencies included in the list of 
users of the SMSAs are the Federal Reserve Banks, the Federal Communications Commission, and various State 
and local planning groups. Considerable use is made of the SMSAs by such private groups as local chambers of com- 
merce, the advertising industry, and various market research organizations. 

In summary, it may be noted that as a “statistical tool,” the SMSAs have made a major contribution in the 
field of statistics in that they make possible more ingful and extensive use of statistics on an area basis. 


Competing Risks and the Follow-up Study. Cain Lona Cutana, University of California, Berkeley. 


In a mortality analysis by cause, three types of probabilities of death may be explored: The net probability, 
the crude probability (both in the sense of Fix and Neyman), and the partial crude probability (the probability of 
death from a specific cause when a second cause is eliminated). Estimators of these probabilities are obtained for 
competing risks problem and for the follow-up study. Asymptotic variances and covariances of the estimators are 
also derived. 

Consider for example the follow-up study for death from all causes. Let z be the years since patient's admission 
to the study and p,; the probability of surviving the interval (z, +1). The Nz patients starting the interval are 
divided into two groups. A group of m admitted more than z+1 years before its closing date, of these s survive the 
interval. A second group of n admitted less than z+1 years before its termination are all counted as withdrawals 
in (zx, +1). Of these w survive to and E die before closing date. The maximum-likelihood estimator is 


with ¢pz =p,(1—pz)/Mz, where Mz =m+n(1+p,"2)-1. 


Method for Measuring Transportation of Passengers and Commodities at National Level. Donautp E. Crurca, 
Bureau of the Census. 


The “books-of-account” barrier has been broken by new concepts and techniques. This started with the one- 
per cent (rail) waybill sample, and was extended by “Shipper Surveys” based on invoices at shippers’ offices, travei 
and auto use surveys based on interviews at the household level, and truck inventory and use surveys based on 
reports for individual! trucks traced by license plates. 

This new approach starts with microscopic detail, such as an individual sales invoice. Its succezss depends upon 
full cooperation of respondents, effective use of probability sampling, and carefully controlled procedures. Coopera- 
tion has been excellent—in recent Census Bureau surveys, only 4 out of 600 companies refused to supply shipper 
information, and only 7 out of 24,000 households declined to answer travel questions. Major technical problems 
have been solved, and new devices have been developed, such as the “LOCAD” System for processing geographic 
and distance aspects. 

Now that survey methods have been tested for collecting statistics on most aspects of transportation, the em- 
phasis has shifted from methodology to operation. Further progress depends upon the joint effort of users and collec- 
tion agencies to determine the specific types of statistics needed and to develop more effective ways for using the 
results 


Some Problems in Multivariate Classification Using Discrete Variates. W. G. Cocuran, Harvard University, AND 
C. E. Hopkins, University of Oregon. 


Specimens are to be classified into one of two universes U, U’, using variates each of which takes only a finite 
number of states (e.g. —, +). If there are k variates and the j-th variate has m; states, the set of variates classifies 
each specimen into one out of m =m; distinct states. 

An optimum rule for classification is obtained as a particular case of Rao’s general method. A specimen which 
falls into the i-th state should be classified U or U’ according as xcpj > or <x’c’p; where x, x’ are the relative fre- 
quencies with which specimens from U, U’ present themselves for classification; c, c’ are the costs of misclassifica- 
tion; p;, p, are the probabilities that a specimen from U, U’ falls into the i-th state (Zp; ==p; =1). This rule mini- 


358 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1960 


mizes the expected total cost of mistakes in classification. If ¢ =c’ =1, the rule reduces to that of minimizing the 
average frequency of misclassification. Note that an increase in the number of variates merely increases the number 
of multivariate states, introducing no further complication in the computations. 

This paper seeks answers to three questions that may arise in the use of the technique. 

(1) In order to set up a rule, estimates of the p;, p; must be obtained from samples of specimens known to be 
classified correctly. How large should the samples be? When the rule is based on such estimates the frequency of 
misclassification will be increased by errors of estimation of the PiDie The average increase is computed as a function 
of the p;, the Di and the sample sizes n taken from each universe. If pilot samples have been selected, this result is 
useful in estimating how much larger samples are needed to reduce this increase to a negligible amount. Tables are 
given showing the maximum increase in frequency of misclassification for n = 50, 100; m (no. of multivariate states) 
=2, 4, 8, 16 and various true probabilities of misclassification. 

(2) If a normally distributed variate is transformed, for simplicity, into a discrete variate with s states. how 
much discriminating power is lost and what is the best partition of the normal curve? Given a large number of in- 
dependent normal variates ill-treated in this manner, the asymptotic relative discriminating power is found to be 
0.64, 0.81, 0.88, 0.91 and 0.94, for s =2, 4, 4, 5, and 6 respectively. Calculations are in progress for a small number of 
variates, to discover whether the asymptotic results for the relative power and the bes¢ points of partition will hold 
in more realistic situations. 

(3) In some applications it is clear that the values of x, x’ =1—z will differ materially from 4. An initial esti- 
mate of x must be obtained from some source. After a number of specimens have been classified by the rule, it is 
advisable to re-estimate from these specimens, so as to verify whether the initially-chosen value still holds. The 
maximum likelihood estimate, given that the p,;, p;/ are reasonably well known, is simple to compute. The problem 
is the same as that of disentangling a mixture of samples.from two different multinomial populations. 


What Do Consumer Buying Plans Forecast? Morris Conen, National Industries Conference Board. 


The dynamic character of consumer outlays for durables and housing has indicated the need for strengthening 
traditional forecasting techniques. Consumer surveys have thus been developed as an independent tool. ° 

Since 1958, The National Industrial Conference Board, with the financial sponsorship of Newsweek, has been 
conducting a survey of consumer buying plans. It differs from earlier surveys in four particulars: First, it is con- 
ducted much more frequently—weekly rather than only a few times a year. Second, the emphasis is on buying plans 
of consumers rather than consumer psychology. Third, buying plans data relate to a wide variety of consumer expen- 
ditures, including not only most major appliances but also furniture, vacations, clothing and housing. Fourth, the 
buying plans horizon is limited to only six months. 

The survey indicates the direction and magnitude of prospective changes in specific consumer outlays. How- 
ever, in view of the short time the survey has been in operation, results thus far are preliminary in nature. The 
predictive record for home and automobile purchases has been quite encouraging. Less encouraging have been 
results for clothing and television sets. A longer period of data collection, a more intensive analysis and a re-interview 
program are required before the full usefulness of the survey can be ascertained with any degree of certainty. 


Estimating Public School Expenditures in the Decade Ahead. Louis H. Concer, Jr., U. S. Office of Education. 


In projecting future costs of the public schools, debt service expenditures should be treated separately from 
current operating expenditures because they depend on different factors and need not grow at parallel rates. For 
example, operating expenditures relate directly to enrollment, but debt service expenditures relate to the change in 
enrollment over an extended previous period, since school construction is the reason for most debt service. 

Future debt service will depend not only on obligations to be incurred in the future, but also on obligations 
incurred in the previous 15 or 20 years. The paper presents methods of dealing with each of these components, in 
the case of the public schools. 


The Identification, Development, and Utilization of Human Talents. Jonn T. Datmey, University of Pittsburgh and 
American Institute for Research. 


In 1960 the University of Pittsburgh, assisted by the American Institute for Research, will carry out an inven- 
tory of the quality and potential of our youth. It is expected that this will, for the first time, give accurate estimates 
of the proportion of American youth with valuable talents for becoming successful engineers, teachers, carpenters, 
machinists, writers, artists, and so forth. 

Educational and psychological tests will be given to a 5 per cent sample of high school students across the 
country, who will then be followed up for extensive study. 

The tests and procedures to be used in the study have been especially developed and will sample each student’s 
aptitudes and achievements in a comprehensive way. The tests will include the types of aptitude classification tests 
developed in the military services, measures of educational achievement in mathematics, English, and other areas, 
and measures of interest and personality. When these test scores are combined with information about the back- 
ground of the student, the community, and the school, the results will provide an accurate inventory of the abilities 
and potentialities of the youth of this country. 

Extensive statistical studies will analyze the interrelationships between test, school, and follow-up data. 


The Computer in the Analysis of Social Statistics. Josera F. Daty, Bureau of the Census. 


With few exceptions social scientists still regard electronic computers as highly specialized desk calculators, 
designed to perform laborious mathematical manipulations at fantastic speed and at equally fantastic cost. 

Modern electronic computer systems are not, however, limited to doing such routine numerical work. They are 
capable of being programmed to carry out all the complex steps of checking, classifying, rearranging and summariz- 
ing socioeconomic data more cheaply and more accurately than clerks can do it. They are capable of multiplying the 
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individual social scientists’s data-processing ability in the same way that electric power has multiplied his available 
physical energy. 

But this capability will not be widely exploited until social scientists unite in clamoring for the development of a 
computer programming language which suits their needs. In the field of mathematics it is already possible for a man 
who knows how to solve a problem to communicate his method of solution to the machine almost as readily as he 
could communicate it toa group of clerks. In the much more complex field of the analysis of social data the job of 
making the computer available to the man with the problem will be more difficult, but will be correspondingly more 
rewarding. 


Locating Outliers in Factorial Experiments. Curnpert Daniei, New York City. 


Factorial experiments are more d. d by undetected bad values than by missing values. The patterns of 
disturbance produced by one and by two bad values are given for two-way layouts and for 2” experiments. Each 
pattern can be summarized by a linear contrast among the observations. An adaptation of Scheffe’s method for 
judging all contrasts in the analysis of variance is used to make the decision on rejection or non-rejection of sus- 
pected bad values. 


Estimating Shelf Life of Bacterial Populations. Ina A. DeArmon, Jr., U. S. Army Chemical Corps. 


The shelf life of a bacterial population is defined as that time, t, when some percentage, p, of the original popu- 
lation remains viable after storage at some designated temperature, 7. Storage at high temperatures for short time 
periods can be used to determine specifically the changes in the bacterial population. 

The probability of death as a function of both time and temperature of storage is one model that has been 
used to write a multiple regression equation that describes the changes in any given bacterial population. From the 
stated model the interpolated and extrapolated estimations of the bacterial »opulation shelf life can be obtained 
for any designated temperature. 

Controlled storage conditions that resulted ir rapid decrease in viable cells have been used with four different 
bacterial populations (wet and dry cells of both a vegetative and spore population) to predict their shelf life. The 
predicted shelf life based on the accelerated st diti is pared to the actual observed shelf life in the 
vegetative bacterial populations. The shelf life predictions for the spore bacterial populations have been made but 
sufficient time has not elapsed to confirm these estimations. 


Uses of Census Information on Real Estate Values for Economic Analysis. Ropert F. Drury, Bureau of the Census. 


The 1957 Census of Governments broke new ground in the field of source materials for economic analysis. 
The quantities of information made available for the first time as to the composition and taxable values of the 
Nation’s stock of privately-owned real estate and as to the relation of such assigned values to current price levels 
as determined from bona fide real estate sales are worth the scrutiny of economists as well as political scientists 
and others concerned with Government and its administration. 

The new Census data provide much needed tools for wealth estimation, for study of the incidence of property 
taxes on major economic sectors, for evaluation of property tax administration and local Government fiscal capacity, 
for study of the volume and characteristics of changes in property ownership, and for many other related researches. 

In addition to the direct usefulness of the data results of the Census project, the statistical techniques—sam- 
pling, exploitation of source records, survey methods, statistical weighting and estimation procedures, and the like— 
developed and applied by the Bureau of the Census for this work afford a methodological blue print for Federal 
agencies, States, local communities, and private researchers who wish to conduct similar surveys to up-date the 
Census benchmarks or explore in detail facets of the material to which Census work has given only passing attention. 


A Statistical and Analytical Technique for Regional Analysis. Epcar 8. Dunn, Jr., University of Florida. 


The purpose of this paper is to present a statistical and an analytical technique for understanding the regional 
development of a national economy. This technique permits the identification of the results of factors (a) operating 
more or less uniformly nationally (even though having a different impact upon different regions), and (b) operating 
more or less specifically in a particular region. It permits comparison of growth in a specific sector of the economy 
in different regions. It is conceptually simple and does not make impossible demands upon the data. 

The technique is based upon computing geographical shifts in sectored layers of economic activity. These 
shifts are used as building blocks to identify (1) a “differential effect” (growth arising out of a differential gain vis-A- 
vis other regions in access to specific markets and /or inputs for one or more specific activities), and (2) a “propor 
tionality” effect (growth arising out of specialization in the nation’s rapid-growth industries). 

The statistical technique is designed to identify and measure these two kinds of forces and, therefore, provide 
a better understanding of what has happened in the past and what might happen in the future. 


The Enumerator Rings the Bell. Lowrett T. Gait anp Rosert B. Votcut, Bureau of the Census. 


In a large fast-moving statistical operation such as the 18th Decennial Census a significant amount of resources 
goes into recruitment and indoctrination of enumerators and providing them with necessary logistic support. 

When the enumerator rings the bell in April 1960 it’s likely to be a woman, mainly by reason of economic avail- 
ability. She will have survived a screening test of ability to understand detailed and exacting instruction and to 
read maps. She will have had the benefit of a varied training program with audio-visual presentation and participa- 
tion through class discussion and practice exercises. In addition to concept definition and technique of marking 
FOSDIC schedules, the enumerator will be instructed in the practical skills of organizing her work, managing call- 
backs, handling difficult respondents and otherwise making her time productive. 

As she leaves her training class the enumerator will be equipped with a portfolio designed to provide her with 
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all d t ation aids, mapping and reference materials which she needs for taking a complete and ac- 
curate census of her country’s population resources and an inventory of its housing facilities. 


Wage Escalation and Wage Inflation. Josern W. Garsanrino, Cornell University. 


This research project is designed to evaluate wage escalation as a system of wage determination. Wage escala- 
tion is defined as the system of adjusting wages during the lifetime of a multi-year collective bargaining contract so 
as to compensate periodically for changes in prices and, in addition, to provide for scheduled increases in real wages 
at regular intervals. 

The analysis starts from the assumption that the industry participants in these power struggles have sought to 
institutionalize these conflicts so as to minimize their impact on internal operating relationships and their political 
“visibility.” They have attempted to develop “principles” that would serve as the framework for an agreed-on 
wage policy and to space the negotiations that test the application of these principles over the longest possible time 
span. The GM-UAW “wage formula” in their 1948 and subsequent contracts is undoubtedly the most significant 
and widely imitated innovation. 

This study involves (1) reviewing the record of wage escalation; (2) investigating the motivations of the parties; 
(2) assessing the economic validity of the reasoning behind wage escalation; and (4) attempting to evaluate the 
compatability of wage escalation with the broader objectives of public policy. 

It must be kept in mind that the most likely alternative to wage escalation is frequent open-ended collective 
bargaining rather than some idealized wage setting system. Many criticisms of wage escalation may turn out to be 
criticisms of collective bargaining as a method of wage setting. 


Ties in Paired-Comparison Experiments. W. A. GLENN anv H. A. Davin, Virginia Polytechnic Institute. 


In making paired comparisons a judge frequently is unable to express a real preference in a number of the pairs 
he judges. Nevertheless, some of the methods in current use do not permit the judge to declare a tie. In other cases 
ties are permitted, but ignored in performing the analysis. Alternatively, the ties are sometimes divided, equally or 
randomly, between the tied members of a pair. 

In the Thurstone-M osteller method it is in effect assumed that all stimulus response differences, however small, 
are perceptible to the judge. In the present paper the assumption is made that a tie will occur whenever the difference 
between the judge's responses to the two stimuli lies below a certain threshold. This threshold and the mean stimu- 
lus responses are estimated by least squares. 

An angular response law is postulated for the response-scale differences. In the resulting transformed data non- 
homogeneity of variances is encountered. A weighted solution is set up, the weights being determined from a pre- 
liminary unweighted analysis. A test of the validity of the model is described. A computational procedure is out- 
lined, and a numerical example given. 


Internationa! Measurement and Analysis of Savings. Raymonp W. Gotpsmiru, National Bureau of Economic 
Research. 


The paper will discuss the advances made in the United States during the postwar period in the measurement 
of personal and national saving, both by the aggregative and the sample methods, and in the analysis of these data. 
On that basis some suggestions for further improv in statistics of saving will be offered. 


+ 


Over and Under Purchasing of Housing. Manvet Gorruies, University of Wisconsin. 


1. In their pure form housing cycles should be regarded as local cycles in a given housing market arising out of 
a recurrent tendency toward over and under purchasing of housing. This tendency is activated in terms of two well- 
known theoretical principles: Cobweb mechanism (lagged adjustment of supply to demand) arising out of a variety 
of circumstances (long gestation period of housing, elasticity of air use and temporary supply adaptation). The 
second principle—the acceleration mech converts teady growth of demand for shelter into cyclical fluc- 
tuations for the means of production of that service (new construction). Local housing cycles will run a wide range 
of durations and amplitudes and contour profiles due to variations in local housing markets, rates of urban growth, 
type of building involved, and quality of expectation at all levels. 

2. Local cycles cannot simply be aggregated or averaged without taking into account two dampening mecha- 
nisms which emerge on the aggregative but not the local level: (a) restrictions arising out of limits of any of three 
basic requisites for residential building (loanable funds, skilled labor and building materials), (b) tendency for local 
cycles to offset each other. The latter tendency can be neutralized by powerful external disturbances. The former 
restriction is released as building requisites may be drawn away by bidding or other means from other sectors of 
construction or fund-users. Thus overall building is characterised by lower amplitudes than residential construction; 
while total construction has still smaller amplitudes. 


Stock Prices and Capital Evaluation. ALAN Greenspan, Townsend, Greenspan and Co. 


This paper attempts to tie the evaluation of stock prices to certain parameters of an economic system: time 
preference, capital productivity, liquidity preference, the savings rate, and the market rate of interest. 

It discusses the significance of certain stock market measures, namely, the stock yield-interest rate spread, 
price-earnings ratios, and the level of risk premiums. It also evaluates the so-called “inflation motives” as a stock 
price determinant. Finally, the paper points out the role which security trading plays in evaluating all forms of 
income-producing assets; the impact on debt generation of changes in the market evaluation of stocks; and the 
relationship between the money supply and stock prices. 

In general, the paper analyzes the financial! underpinnings of a apeculative boom. It goes into certain character- 
istics of speculative markets: the strong stimulus to industrial production stemming from rising stock prices; the 
pressure on the money market as rising market values proliferate all forms of borrowings; and the tendency for 
short-term interest rates to move above long-term rates during speculative periods. 
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In conclusion, the paper develops the role of short-term interest rates in snuffing out speculative bull stock 
markets. In this connection, it reviews the concept of a “New Era,” the historical parallel of the 1950’s and 1920's 
and the role of the Federal Reserve in stock price fluctuations. 


Truncatiou and Tests of Hypothesis, Il. Irwin Gurrman, McGill University. 


This paper continues the work begun by Aggarwal and Guttman, Ann. Math. Stat., Vol. 30 (1959), pp. 230-8, 
and examines the loss of power when using tests based on the assumption that the variable being sampled has a 
“complete” normal distribution, when in fact sampling is from a symmetrically truncated distribution. The hypothe- 
sis considered here is the one-sided test for the variance of a normal distribution. 

Some tables have been computed, and they show: (1) that appreciable losses in size of the test occur; (2) that 
some loss occurs in the power, but this decreases with the alternative value of the variance and the degree of trunca- 
tion and (3) that the gain in power when using correct procedures decreases in the same way. Correct significance 
points for “correct procedures” are also given. 


On the Measurement and Control of Measurement Errors in Censuses and Surveys. Morris H. Hansen, WILLIAM 
N. Hurwirz anp Max A. Bersnap, Bureau of the Census. 


The purpose of the paper is to present some theory and to illustrate the applicability of the theory to some 
practical probl in the t and control of resp and pr ing errors in censuses and sample surveys. 
If the response errors are uncorrelated, the paper demonstrates that response variability will not be an important 
factor in many practical problems in either censuses or surveys, and the usual estimates of sampling errors in sample 
surveys properly reflect the response variability. If the errors are correlated, as is the usual case, the problem is 
more difficult. Some methods for estimating certain important aspects of correlated response errors are given, 
and some results of experiments conducted during the 1950 Census of Population are presented along with the 
implications of the results on methods of taking censues and sample surveys. 


The Evaluation and Research Program of the 1960 Censuses. Morris H. Hansen, Leon Pritzker snp JOSEPH 
Sremsera, Bureau of the Census. 


The purposes of the Evaluation and Research Program are in broad terms: 

1. The identification of the sources of error in census results, the development of improved of 
ment and control, and the evaluation of the accuracy and cost of alternative census methods and of alterna- 
tive methods of evaluation. 

. The measurements of both the variance and bias—i.e., the mean square errors—of statistics of the 1960 
Censuses. These should guide users in the appropriate application of Census statistics and should help guide 
future census plans in adapting programs to the needs of users. 

We are devoting approximately one and one-half per cent of the Decennial Census appropriations to research 
and evaluation. In the current stage of planning of the Evaluation and Research Program, the following seven proj- ° 
ects have been established: 
Project A Measurement of Nonsampling Variability 
Project B Measurement of Coverage Error: Reverse Record Checks of General Sample of the Population of 
the United States 
Project C Measurement of Coverage Error: Reverse Record Checks of Special Samples of the Population of 
the United States 
Project D Measurement of Coverage Error: Re-enumerative Studies 
Project E Measurement of Content Error 
Project F Studies of Processing Error 
Project G Analytical Studies 
Project H Other Coverage Improvement Studies 
These projects are described in the paper. 


A Modified Gauss-Newton Method for the Solution of Non-linear Squares Problems. H. 0. Hartiry, Iowa State 
Universii,. 


thod 


Consider the following non-linear least squares problem: Given n sets of “obse-\ed” k+1 tuples ya; mA, 
ah, * * * , eaA(h=1, 2, * , n) anda non-linear regression function depending on m parameters 
m) f(x; %) =f(ai + + + ze; 1, * * * , Om) required to determine the parameter vector @ for which the sum of squares 


= — (xn; 9))? = minimum. 


Two methods of solving this problem are fairly well known. 

(a) The method of steepest descent applied to Q(4). 

(b) The Gauss-Newton method of linearizing f and determining (iteratively) corrections to @ from linear least 

squares equations. 

Whilst (a) is known to converge under wide conditions its rate of convergence is found often to be painfully slow. 
On the other hand (b) may reach the solution at a much faster rate but its convergence is not d. The p t 
method, which may be regarded as a modification of (b) has the advantageous features of both (a) and (b) and is 
free from their disadvantages. 


The Application of Ratio Estimators to Anaiytic Studies. H. O. Hartiry, Iowa State University. 


The data from a sample survey are often used for what are called “analytic studies.” These usually involve 
comparisons of means and totals for the units in certain subgroups of the population sampled which are called 
“Domains of Study” by the U. N. Subcommission on Sampling. The problem is therefore the sample survey ana- 
logue to the comparison of “treatment means” in the statistical treatment of experimental data. 
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The paper first provides simple estimates i? of the group totals, i¥ for a measured characteristic, y for the units 
of the j-th group by attaching to all units in the finite population the ch teristic 


fr if unit is in j-th group 
of 0 otherwise 


and substituting these in the appropriate estimates of the population total. Variance formulas and estimates of 
variance formulas then follow automatically by applying standard formulas of survey methods to the characteristics 
jy. Estimates of group means if are finally obtained as ratio estimators ;? /;{ by dividing the iY by the analogous 
estimate ;U of the population size in the j-th group. The great wealth of “ratio estimation” is thereby made available 
for the estimation of group means, differences of group means and their variances and approximate confidence 
intervals. Unbiased ratio estimators are of interest in certain situations. 


Sampling with Unequal Probabilities without Replacement. H. O. Hartiey anv J. N. K. Rao, Iowa State University. 


In sample surveys we are often concerned with drawing a sample of n units from a population of N units 
(i=1, 2, + + +, N) to which are attached known “measures of size” z;. These “sizes” 2; are often linearly related to 
the characteristics yj; whose population total Y is to be estimated. One of the pling techniq available for 
utilizing the z; in such a situation is to draw n units with “probabilities proportional to the size zj.” When the samp- 
ling fraction n/N is not too small it is further desirable that sampling be “without replacement” as this is expected 
to result in an increase in the precision of the estimate of Y. A sampling technique achieving the above objectives is 
here offered which has the following properties: 

(i) An unbiased estimate ¥ of ¥ can be computed. 
(ii) The variance of ¥ depends only on the variation of the ratios y;/z; and has the “finite population correc- 
tion” attached to increase precision of 

(iii) The variance of under (ii) can usually be estimated by a positive estimator. 


Pianning and Financing of Social Research: The University as a Force. Pamir M. Hauser, University of Chicago. 


The University, while it is still the major instrumentality in social research, is becoming a less, rather than a 
more, important force in determining its general direction and in the planning of specific projects, Social science 
research is becoming increasingly directed by non-university agencies—government, foundations, and business. 
Social science in becoming subservient to sources of financing is directing its activities more to problems, the amelior- 
ation of which concern the sources of funds, than with the basic advancement of social science. Social research does 
not get adequate support from the University itself, either because the University still adheres to 19th century 
conceptions of what social science is and how social research is ducted; or b of impoverishment it is unable 
to provide the necessary finances. 

Present trends constitute a sharp reversal of patterns of planning and financing social research a generation 
ago. In the interest of advancing social science, present ways of planning and financing research merit the careful 
consideration of social scientists, University administrators and the various agencies providing research resources. 


Regional Accounts: Objectives and Framework. Werner Z. Hirscn, Washington University and Resources for the 
Future. 


Regional Accounts, if properly designed, can aid in three main decision areas— 

1. Local public decisions about public services, expenditures, and taxes; traffic and transit; land use and other 
resources planning and regulation. 

2. Plant location and marketing decisions by business and industry. 

3. Federal government activities, which either are designed to create an appropriate environment conducive 
to the making of local public and private decisions consistent with certain stipulated national objectives, or can 
generate major local effects. 

A framework is developed which helps trace and quantify effects of specific local public or private policies, or 
of exogenous forces on the five main dimensions of a “region’s health and wellbeing”—economic growth, economic 
welfare, basic employment stability, net social benefits and amenities of life. 

The regional interaction model has three main elements—an “asset” analysis, a “flow” analysis, and an “impact” 
analysis, and it proceeds in four consecutive stages with numerous feedbacks. In the model the local economy is 
conveniently separated into private and public sector. Assets and liabilities of the private and public sectors consti- 
tute the environment that faces private decision makers and conditions their action. 


Monthly Change Indices and Other New Measures of Business Fluctuations. Jonny H. Hoacianp, Michigan State 
University. 


Based on monthly reports of purchasi tives in more than 1,000 firms, important new business indicators 
have been developed for such significant items as new orders; order backlogs; industrial production; employment; 
prices; lead times; profits; raw material inventories; and raw material inventories of steel, copper, aluminum, elec- 
trical P hanical ts, and cutting tools. Data for these indices, called Monthly Change In- 
dices, have been developed into tne series and analyzed for seasonal, cycical and other fluctuations by Census 
Method II. Some of these time series cover the period of time since 1935, and many provide new and excellent meas- 
ures of business activity during the past ten years. 

In addition to providing new measures of business trends, these Monthly Change Indices have the advantages 
of being available sooner than government indices, performing more smoothly and more reliably than most other 
indices, warning of changes in b trends ths ahead of time, and indicating trends in regional areas as well 
as national trends. 

Other new measures of business activity in such important areas as monthly indices of capital equipment and 
purchased parts will also be presented in this paper. 


4 
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To illustrate the timeliness and usefulness of these new indices, this paper will include a discussion of business 
trends during 1959, including the month of December, 1959. 


Linear Multivariate Models as Representative of Clinical Judgment. Paut J. Horrman, University of Oregon. 


Decisions were obtained from 29 clinical psychologists and trainees as judges concerning the disposition of 
1,000 mental patients. The clinicians had available only quantitative information with respect to 11 patient char- 
acteristics (predictor variables). This information was presented in a controlled fashion and on prepared profile 
sheets. A linear multivariate model was developed for each judge and the relative contribution of each of the pre- 
dictors to the judgment or decision was thereby established. The adequacy of the linear multivariate model in 
accounting for the variations of judgments was likewise established for each clinician. 

For many of the clinicians serving in this study, linear models appear quite satisfacatory in representing the 
judgment processes and in fact, in many cases, the process may be explained on the basis of a simple additive combi- 
nation of a relatively small amount of the total available information. On the other hand, there were some judges for 
whom optimal configural validity is significantly larger than that approximated by the linear model. Although there 
are widespread individual differences in the extent to which clinicians deal with information in configural ways an 
adequate description of the judgment process can be accomplished with much simpler models than has been gener- 


ally supposed. 
Some Experiences in the Practice of Medical Statistics. Ropert G. Horrman, University cf Florida. 


Without success so far, the author has attempted to interest the American Statistical A iat in 
ing a section for Medical Statistics. Medical education, practice and research all have areas in which the statistician 
can contribute. The importance of medical problems is illustrated with examples from (1) Normal values in medicine; 
(2) Problems and significance of clinical laboratory tests; and (3) Current clinical problems. 

The first example deals with the need for criteria and additional! data in establishing normal values in medicine. 
The second example presents data from a hospital where the blood usage dropped a thousand pints per year with 
the correction of a consistent laboratory error. Statisticians can contribute to these problems. The third example 
deals with the establishing of statistical service systems to hospitals. Through these systems it is possible to keep 
track of many current medical problems as well as some of the fi ial aspects of hospitalizati 
of this kind is lacking in medicine today. We do not even know how many male and female patients are hospitalized 
in this country annually much less other details. The most important feature of the systems is that it makes possible 
the study of current clinical problems by practicing physicians. Potential uses of statistics in medicine have hardly 
been started. 

Statisticians working in all areas of n.. Jicine should have a section where they can meet and discuss common 
problems. 


Programming the Population Census. Ricuarp A. Horwsers, Bureau of the Census. 


“Programming,” as interpreted and applied in processing the 1960 Census, includes far more than tabulation of 
census returns. The computers will perform as part of a system planned so that it becomes theoretically possible to 
feed in, for example, the original 100 per cent returns for a State and have come out one month later the print copy 
for the entire 100 per cent publications with an absolute minimum of human intervention. In order to help achieve 
this goal, the computers are expected to undertake as much as possible of the editing, evaluation, correction, and 
control of returns; the detection of and adjust t for system malfunction; the preparation of final print copy; and 
the combination of several operations into one, such as by processing population and housing data together. 

In addition to these functions, the computers are being used to prepare ED listings for use in control of field 
and clerical operations; to help prepare the detailed geographic codes for each ED to permit summarization of results 
by ED'’s to all the geographic areas and classes shown in publication; for the ple, to prepare or verify as much as 
possible the elaborate codes required for tabulation of detailed characteristics; and to determine and apply weights 
for the sample persons and housing units. 


+i 
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Sequeutial Chi-Square and T? Tests. J. E. Jackson, Eastman Kodak Company and R. A. Brap.ey, Florida State 

University. 

Sequential procedures are developed to the composite hypothesis: 

Ho: — wo) — wo)’ = 
against the alternative 
Aa: — yo) — wo)’ = 43 

where ye is a vector of constants and \3 >aj. The sequential chi-square test is obtained when = is known and the 
sequential 72-test is cbtained when & is unknown and must be estimated from the sample. Both of these tests are 
extended to the two sample case. 

It is shown that the conditions of Cox's Theorem, guaranteeing the existence of these tests, are fulfilled. Con- 
jectural expressions are given for the ASN function under H» and H:. The chi-square tests are generalized in the 


manner of Hotelling s generalized 7* statistics. The preparation of tables to facilitate thees tests is discussed and an 
industrial application related to ballistic missile testing is included. 


The Consumer Price Index—Technical Questions and Practical Answers. Stpney A. Jarre, U. S. Department of 
Labor. 


Among questions persistently asked about the Consumer Price Index is what distinguishes it from a cost-of- 
living index. In contrast to the CPI which holds purchases constant, a cost-of-living index should measure price 
ge holdi tisfaction or utility constant. Unfortunately, there is no operational approach to a cost-of-living 
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index in either this sense or in terms of the price component of a change in expenditures of different composition. 
Under normal circumstances, however, the CPI can be considered a good approximation to the changes in the cost- 
of-living. 

In our dynamic economy products change too rapidly for the comfort of index makers, leading to questions 
about the effect of quality change upon the index. It is an oversimplification to lude that b quality is up 
the index is biased upward. This ignores the CPI mechanism. Quality can and is factored out of some price com- 
parisons. In some cases, e.g., the new automobile, direct comparison of old and new prices lead to upward biases. 
In other cases, linking procedures tend to factor out not only quality changes but also real price changes as well, 
causing downward biases. 


The Adequacy of Current Textbooks for the Elementary Course in Statistics. Evererr H. Jounson, George Washing- 
ton University 


An appraisal of adequacy of a text in statistics consists of both subjective and objective considerations. The 
extent to which adequacy is achieved depends on needs and abilities of the students, experience and desires of the 
instructor, and judgment and opinions of the appraiser. C quently, it is not to be expected that any text could 
be universally adequate. 

Too many texts have been written ostensibly to present certain of the more common statistical techniques 
together with their theoretical connotations, and without sufficient attention to the proper role of statistics in the 
field of application. Texts of the applied type would be more useful if statistical theory were subordinated to the 
subject field and statistical procedures pr ted and di din their proper role—namely, tools of analysis supple- 
mental to the non-statistical ones of a particular field. To achieve such a goal an author should be not only a com- 
petent statistician, but also be thoroughly familiar with the field of application, and should plan his manuscript 
to accomplish a specific aim rather than to compile an encyclopedia of statistical procedures. 


Some Interrelationships Between Buying Intentions, Purchases and Expectations. F. THomas Jusrer, National 
Bureau of Economic Research. 


Alternative hypotheses about the relationship between the fulfillment of buying intentions and the household's 
expectati about i are tested. We find that for all households there is a strong negative relationship between 
the ratio of purchases to plans and income expectations, holding experienced income change constant; that is, 
h holds with optimistic expectations tend to have low ratios of purchases to plans, and vice versa. This finding 
is consistent with the hypothesis that purchases are related to intentions and “income surprise” (the contingent 
action hypothesis) but is probably inconsistent with the hypothesis that favorable expectations result in more pur- 
chases, relative to intentions, because of the bousehold presents a more consistently optimistic response to a battery 
of questions that measure and reflect attitudes (the additivity hypothesis). We also find, however, that this aggregate 
behavior is not characteristic of group of households that report zero buying intentions; here, favorable expectations 
are likely to result in more rather than less purchases. This might be due to income expectations being a proxy vari- 
able for intentions, indicating how close sero intenders are to the threshold of being intenders. Because of the dichot- 
omy in behavior between intenders and non intenders with respect to the effect of income expectations on pur- 
chases, it is argued that interaction variables are essential in multivariable analysis of purchases whenever buying 
intentions are one of the independent variables. This dichotomous behavior of intenders and non intenders may be 
important not only for expectation variables, but also for variables reflecting the fi ial and demographic situa- 
tion of the household, i.e., income, assets, debts, ages, marital status, stock of durables, and so forth. 


Population and Welfare in South Asia. NarHan Keyrirz, Uriversity of Toronto. 


The fall in the Asian death rate is taking place over a period of twenty years rather than over one hundred 
years as in the West. The impact of medicine and of DDT is too sudden for familial and other institutions; the pres- 
sure for large families so essential for survival through all previous history is not easily reversed. In the West the 
hundred years necessary for their reversal provided manpower for machine industry and a comfortable expansion 
into the spaces of the New World. But in a hundred years Asia’s population would multiply fivefold from already 
high levels. We know that institutions and practices are more quickly changed by a wealthy and educated popula- 
tion than by a poor and ignorant one. The case of Japan shows how’rapidly births can fall when rising income 
and public communication have brought a rational approach. Will Asia’s crowding prevent the increase in income 
and rationality that would solve her population problem? The danger of this is not only that the additional people 
have to be equipped up to the level of the old before there is capital for raising income per head, but more profound 
and pervasive: within the rural institutions of South Asia population growth operates to depress the technological 
level of agriculture, and to lessen steadily the number of workers that can be supported by industry. 


Some Sampling Techniques for Continuing Survey Operations. Lestie Kisa anp Irene Hess, Survey Research 
Center, Univ. of Michigan. 


In planning the repeated use of a sampling frame the sampler has opportunities for improvements over single 
designs. (a) A larger sample is first prepared and this serves (for one to three years) as the frame from which samples 
are drawn as required. Efficiency is increased through better tb ies. of size and stratification 
(b) Double sampling (of dwellings, blocks, tracts, etc.) can be introduced, spreading the cost over many samples. 
(c) Better knowledge of response and coverage rates yields closer control over sample size. (d) The effects of non- 
response can be reduced with a “replacement” procedure. (e) Cost and variance factors can be cumulated and 
averaged. 

We describe also a modification of Keyfitz’s (J ASA, March, 1951) scheme for changing the probabilities of 
selection of sampling units, 
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Analysis of Survival Data by Regression Techniques. S. A. Kranr anv R. J. Buesier, Jowa State University. 


The nature of the process of retirement of groups of industrial properties is so complex that it is difficult to 
postulate adequate mathematical models such as those employed in life-testing, etc. Assuming that there exists 
a survivor function, M(t), representing the proportion of a group remaining in service at time ¢, such function is 
given by exp (—v(t), where v(t) is the time integral of the retirement rate, r(t) = —d(log M(#)) /dt. Rather than to 
hazard a guess as to the parametric form of any of these functions, it is the intent of this paper to approximate the 
integral, v(t), by a polynomial, whereupon M(t) may be graduated by the descending exponential function. For 
large samples it is found that the covariance structure for the polynomial regression of v(t) on t may be obtained 
from the multinomial distribution when the data are grouped. Thus the method of weighted least squares may 
be employed in fitting v(t) “Censored” data in no way vitiate the method. 


The Components of Change Program in Relation to the 1960 Census of Housing. Franx 8S. Krisror, Bureau of 
Census. 


This paper examines the rapid development of housing statistics in the last 20 years, including the 1940 and 
1950 C s of Housing. The development of a need for statistical tools to measure the dynamics of the housing 
inventory-is discussed in terms of questions not answered by Census data. These include: “What is the rate of new 
construction, of changes such as mergers and conversions, of additions to the housing supply from nonresidential 
space? What is the rate of demolition and other withdrawals from the housing inventory? How does the widespread 
mobility of the American population affect housing demand?” 

The development of the Components of Change program in the 1956 National Housing Inventory to answer 
some of these questions is described. A description of the scope, content and techniques of the 1959 Components of 
Change program is provided. The unique opportunities for analysis of the housing inventory provided by this pro- 
gram in conjunction with the 1960 Census is described, along with a description of the steps taken to provide a link 
between the Housing Census and the 1959 Components of Change program. 


Government as a Source of Anticipatory Data. Srantey Lesercorrt, U. S. Bureau of the Budget. 


Federal soothsaying in the Agricultural Outlook Conferences and other forums has a well-known history and 
wide acceptance. There has been far less agreement about the Federal collection of anticipatory data that analysts 
in business, labor or government could use to make their own projections. The paper reviews some present and po- 
tential Federal progr for collecting such data. Among the measures considered are anticipatory data on: con- 
struction starts and activity; manufacturers inventory change; consumer spending for durables; forthcoming expen- 
ditures by state, local and Federal Government. A program to measure new orders by retailers is also outlined, as 
helpful in suggesting future d d at the wholesale and manufacturing levels. Finally a measure of changes in 
factory earnings is suggested as helping to point out developing wage patterns and, in some degree, subsequent price 
changes. Without specifying who should conduct each of these programs the paper does assume a Federal interest 
in having such information. For with such data at hand economic policy officials can better choose those policy 
actions that are most appropriate in the light of the economic trends likely to develop in the absence of specific 
Federal action. 


Quantitative Methods in Sociology. Georcr A. LunpBera, University of Washington. 


Quantitative methods in sociology have been under serious development chiefly during the four decades 1920— 
1960, which correspond roughly to the professional career of William F. Ogburn. The principal issues regarding 
quantitative methods which engaged sociologists during that period were: (1) philosophical controversy arising 
from the assumption (a) that some phenomena are intrinsically and inherently quantitative (e.g., such as most of 
the subject matter of demography), while (b) other phenomena are intrinsically qualitative. The contrary view re- 
garded quantitative and qualitative aspects of phenomena not as a characteristic of phenomena but purely a matter 
of what method of observation and ding is adopted. Another issue was, (2) the controversy over the relative 
merits of the “case study” versus the statistical method, especially as regards objectivity, prediction, and other 
purely scientific objectives, as contrasted with diagnostic and therapeutic objectives. Perhaps the principal problem 
of quantification was reflected in (3) the controversy over attitude scales and techniques of studying verbal behavior 
and public opinion, and the relationship bark such data to other types of social behavior. All of these developments 
greatly stimulated interest in li hniq mathematical models, information theory, and machine tech- 
niques, of processing data. The implications of these developments for sociology as a sci are impressive. 


Purchasing Waves. Ruts P. Mack, City of New York. 


The paper explores the characteristics of fluctuations in materials purchasing. Following the lead found in 
studies of several individual industries, it examines whether the purchasing of “raw” materials in many industries 
undergoes waves of shorter duration than those ly d as busi cycles. 

Over and under purchasing occurs to the extent that there i is rf difference between the amount of basic materials 
currently bought by business firms and the amount incorporated in finished goods currently sold. This central 
phenomenon, plus the rounds of cause and effect with which it is associated, constitute a purchasing wave. 

The best measure of over and under purchasing is the rate of change in ownership of materials—changes in 
purchased materials stocks on hand and on order. Evidence of the purchasing wave as a whole appears in changes 
in sensitive prices, production, income and the like that are associated with the shifts in ownership position. 

The indicated time series are examined. They exhibit waves averaging some 18 months in duration which con- 
sist of two sorts of movements—(1) those associated with but leading the shorter phases of business cycles, (2) 
extra movements which interrupted many cyclical banks. The extra movements appear at about the same time in 
both durable and non-durable goods industries. 
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The implications of these findings are summarized in terms of useful statistical records, the competitive scene 
and the study of business fluctuation. 


Disease Severity Quantitation. CLirrorp J. MaLoney, Fort Detrick. 


The fact of death in quantitative evaluation of virulence of bacterial or viral agents has a simple unambiguous 
interpretation. Quantitative studies on incapacitation are provided with no such unique criterion. Temperature rise, 
malaise, antibody titre, gross and microscopic lesions may all be employed, raising questions as to the reality and 
dimensionality of a scale of illness. 

The problem appears related to the problem of scaling, extensively studied in psychometrics. Applicability of 
certain of these techniques to multiple measurements in a morbidity experiment is under study. 


Multivariate Chebyshev Type Inequalities. ALserr W. MarsHa.., Stanford University. 


Some procedures useful in deriving multivariate Chebyshev «ype inequalities are given, and the problem of 
proving sharpness is discussed. The ideas involved all stem from an analysis of the following standard proof: Let 
X be a k-dimensional random vector and let T be a subset of Euclidean k-space R*. If f is a non-negative function on 
such that f(z) >1 for all XC 7, then 


P(x Ti < dP 
ix € Er < Ef(z) 


A number of known inequalities are reviewed, with emphasis on recent results rather than completeness. 


The Problems of Questions such as Religion and Origins in the Canadian Census. J. T. MarsHati, Dominion 
Bureau of Statistics, 


Questions pertaining to such personal factors as origin and religion were provided for in an Act passed by the 
Canadian Parliament in 1870. Such questions have appeared in the Censuses over since. Problems arising from 
these questions are not primarily those associated with mechanics of tabulation but rather, are connected with the 
field work concerned with securing accurate, useful information. 

Contrary to what might be expected, resentment and antagonism on the part of the Canadian population, is 
very slight and the problems are largely iated with explaining or finding ways of explaining simply and clearly 
what it is that the census taker is trying to secure. Some questions are likewise conditioned by the fact that the 
respondent is not in a position to provide precise answers. Of course, questions on religion and origin may evoke 
a certain t of tional resp on the part of the people which in some circumstances results in erroneous 
replies which may impair the relative accuracy and usefulness of the statistics. 

Despite the shortcomings of the statistical material on religion and origin, they have been useful to numerous 
individuals and organizations. It is confidently believed that by thought and trial, particular questi or bi 
tions of questions are being developed that will produce a useable body of census data in 1961. 


Recent Changes in Programs for Labor Force Measurement. Marcaret E. Martin, Bureau of the Budget. 


The new arrangements which shifted responsibility for the planning, analysis and publication of monthly labor 
force statistics to the Department of Labor are described. Changes in the labor force statistics program as a result 
of the shift have been minimal, although some changes in emphasis in the future may be anticipated. 

Concepts underlying labor force measurement, payroll employment series and reports of unemployment in- 
surance activities were reviewed in 1955 by an interagency ittee which obtained advice and suggestions from 
a wide number of users, both public and private. The Interim Report of this Committee contains a number of recom- 

dati for changes and impro ts in the series. 

Actions since taken by Federal agencies to implement certain recommendations of the Review of Concepts Com- 
mittee are briefly noted. Some of these improvements have already been incorporated in the published statistics, 
some of them are still underway. 

Other Committee proposals for changes in concept, improvement or expansion of the series, or additional re- 
search are ized, indicating some of the reasons no action has been taken so far on these recommendations. 

Finally, a few proposals are singled out as most important for future development in programs of labor force 
measurement. 


An Experimental Study of Several Factors Affecting Rate and Nature of Response to a Mell Survey of Beginning 
Teachers. Warp S. Mason anp Rosert J. Dressex, U. S. Office of Education anp R K. Barn, Purdue 
University. 


In a pretest of a study of beginning public school teachers an experiment was conducted to determine the effect 
of two factors, length of form and mode of addressing, on the rate of response and type of resp It was luded 
that for this population an eight-page questionnaire with name and address of the respondent appearing on the form 
would produce a response not significantly different in rate or type from that of a six-page form identified by only a 
code number, or any other combination of these factors. There was some evidence that when forms were inadvert- 
ently sent to individuals who did not belong in the population being studied, rate of response is related in slight 
degree to these factors of form design. 


Final Grade Response to Homework Input. Franx Meissner, Stanford Research Institute. 


For several semesters, homework was mandatory in my elementary statistics course at Golden Gate College in 
San Francisco. In the Spring 1959 semester I left it up to the students whether or not they wanted to hand in their 
assignments. The association between homework inputs—measured by the percentage of ail assignments that a 
student completed—and the course output—measured by the final grade—provided an expression of “response 
elasticity of the final grade to homework input.” 
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In the letter grade range between F's and B’s the elasticity coefficient turned out to be 0.49. In other words, on 
the average, a completion of 10.0 per cent more assignments was associated with a boost of the final numerical grade 
by 4.9 per cent. The elasticity between B’s and A’s was only 0.24. This means that it takes more than just doing the 
homework to b an A student 

This tangible illustration of homework pay-off is now being used as an inducement for students to do their 
assignments. Further more, the response elasticity concept might have many other applications in measuring 
impact of certain teaching techniques. 


Sampling in Asia and the Far East. J. E. Morton, United Nations. 


The kind and importance of quantitative mass information most frequently needed in the so-called under de- 
veloped countries. The resulting need for appropriate survey methods. General vs. special purpose data. Partial 
and complete canvass in the light of non-sampling errors, of scarcity of statistical inputs, of the need for aggregative 
information, and of the p of rapid chang 

Special problems of under developed countries concerning sampling plan, field work, estimation, processing, 
and use and interpretation of data. An example: the interpenetrating design. The byproducts of sampling as a survey 
technique and its effect on statistical operations in general. 

Limitations in the use of sampling methods for application to canvasses in under developed countries. Sampling 
vs. non-sampling error and the question of census vs. sample survey. 

Direction and nature of assistance to improve the statistical inventory of under developed countries and the role 
of the sampler: the need for a more nearly quantitative outlook, the usefulness of model i , the req 
ment of long-run approach and statistical programming and systems planning, and the importance of training and 
administrative implementation. 


Some Statistical Techniques for Industrial Tolerancing. R. B. Murpuy, Bell Telephone Laboratories. 


Three methods are discussed for studying statistical characteristics of system resp to Pp t character- 
istics. Special reference is made to the problem of industrial tolerancing. These three methods are propagation of 
error, selective assembly, and a technique called multiplex sampling by its deviser, D. H. Evans. The purpose of 
using these techniques is not only to specify component combinations yielding satisfactory system response but to 
do so economically. An example of the economic approach in propagation of error is discussed. Formulas recently 
developed by J. W. Tukey give the first four cumulants of a single real response in terms of the first five moments 
of component characteristics. Methods are presented for simplifying the formulas for the cumulants by transforming 
component characteristics and system response. Selective assembly is approached with a view to maximizing the 
total fraction of usable material by means of combining component characteristics randomly from specified inter- 
vals. Multiplex sampling is presented as a Monte Carlo technique to aid in the selection of combinations of com- 
ponents to obtain desired response with least possible cost. 


On Drawing Inferences in the Case of an Exponential Distribution with Censoring. Jack Napuer, Bell Telephone 
Laboratories. 


The situation where observations are to be taken from an exponential distribution with single censoring on the 
right is considered. A sequential sampling procedure that amounts to using an inverse binomial sampling plan is 
proposed. With this procedure there exits a sufficient statistics that has a x? distribution when suitably normed. Thus, 
appropriate estimators and exact tests of significance are often easily derived. 


Measures of Product Concentration, 1947 and 1954, Developed from Census Materials. Ratpu L. Newson, National 
Bureau of Economic Research. 


The paper presents some preliminary findings of research on industrial concentration carried on last year at the 
Census Bureau. The project, part of the Census Monograph program of the Social Science Research Council, was con- 
ducted under the auspices of its Committee on Economic Census Data. Concentration measures were calculated for 
a sample of 399 product classes, for the two Manufactures Census years 1947 and 1954. The economies of electronic 
computers made these tabulations feasible. 

The present focus is a preliminary test of the hypothesis that concentration will decline as markets grow, a 
corollary of the principle that the division of labor is limited by the extent of the market. The period 1947-1954 
is rather short to identify trends, however, market growth was substantial. 

The evidence indicates a net decline in product-class concentration over this period. Interestingly, concentra- 
tion measures based on the industry concept, where all of an establishment's activity is assigned to its primary prod- 
uct, show little change. This suggests that i in dary product activities may have been important in 
reducing product concentration. Such changes were heretofore concealed, as historically comparable data on product 
concentration did not exist. 

Finally, the increase in the divergence between company and plant concentration is established and its impli- 
cations are examined. 


Some Meterological Factors in the Design of Weather Modification Experiments. FranxkLIn Newna.t, U. S. 
Department of Agriculture. 


Statistical evaluation, after the fact, of weather modification experiments, using numbers instead of attributes 
raises a question as to what particular form of the meteorological observation should be used as a basic measurement. 
A related question is the extent to which partially subjective metorelogical methods should be used to classify events 
for statistical evaluation. The use of the unit “storm” for most evaluation work by the evaluation unit of ACWC 
is explained in connection with the first question and the rejection of storm typing as a procedure deals with the 
second question. 
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Types of Family and Types of Economy. M. F. Nimxorr anp Russeit Mippieton, Florida State University. 


A rough relationship exists ketween type of family system and the subsistence pattern ordered according to 
productivity and stability. The independent family is associated with hunting and gathering cultures; the extended 
family is correlated with societies that are dominently based on fishing or agriculture. Two economic factors seem 
to be especially relevant in accounting for these differences: the size of the food supply and the amount of family 
property, both of which are generally less ample among hunters than among fishermen or planters. 

The analysis is based on (1) statistical data for 549 cultures, taken from Murdock’s “World Ethnographic 
Sample,” representative of the entire known range of cultura] variation; and (2) an examination of the literature on 
21 hunting cultures with the extended family system, constituting negative cases. 


The Federal Budget as an Indicator of Government Intentions and the Implications of Intentions. Davip Novick, 
Rand Corp. 


Our Federal Government is one of the largest and most complex businesses in the world. The largest private 
industrial firm, General Motors, had a sales volume in 1958 of $9.5 billion; the Federal budget was about $80 billion. 
The mort complete compiling and analyzing of the activities of our government takes place in the budgetary 
process, culminating in the budget document itself. The purpose of this paper is to consider the extent to which the 
Executive budget can serve economists and other analysts as an indicator of government intentions and, perhaps 
just as importantly, as an indicator of the implications of these intentions. 

After presenting several possible meanings of “intentions” and the “implications of intentions,” the question 
of how well the present budget structure reflects these considerations in economic terms is discussed in some detail. 
The conclusion is that the present budget does not serve as a very definitive indicator for the following reasons: 

(1) Lack of an “end product” or “activity” oriented structure 

(2) Orientation almost entirely to administrative or organizational requi t 

(3) Time span covered by the budget is too short 

(4) Insufficient distinction between one-time capital outlays and recurring annual operating expenses 

(5) Budget concepts and language are too complicated 
With this material as background, alternative budget structures are analyzed, and the general characteristics of a 
“preferred” structure are outlined, along with an illustrative example of a budget format embodying these sug- 
gestions. 

It is concluded that the major characteristics of a good budget structure from the standpoint of serving as an 
economic indicator of intentions and the implications of intentions are essentially the same as those required when 
the budget is viewed as a management tool to assist in making major resource allocation decisions. 


Investment Companies—Their Investments and Investors. H. S. Onera, National Association of Investment 
Companies. 


This paper attempts to present some observations on the growth of the investment company industry based on 
available composite statistical data. As investment companies serve as conduits between individual and iastitutional 
investors and their investments, examination is made of the general fields of investment on the one hand and the 
characteristics and investment objectives of the owners of investment company shares on the other. A description 
of regularly collected and available data on the industry is presented. Particular reference is made to the “service” 
features of investment company ownership that are currently being measured by a number of trend series. Based 
on a study of investment company shareholders, two types of individual investors are compared—regular account 
holders, who have made a lump sum investment but have not indicated a plan for periodic investment of new money 
and accumulation plan holders who are investing additional monies on a preiodic basis. 


The Methodology of Twin Studies. Ricnarp H. Osporne, Sloan-Kettering Institute for Cancer Research anv Oscar 
Kemprnorne, Iowa State University 


In the early work on the theory of quantitative inheritance by R. A. Fisher human data was used. In spite of 
this, and the fact that the major part of human inheritance is quantitative, attention in human genetics has focused 
almost exclusively on the so-called single gene effects. The methods of human genetics have been largely confined by 
tradition to pedigree analysis and the use of twins for estimating the relative importance of heredity and environ- 
ment. Pedigree analysis is inappropriate for the study of qnatitatively inherited characters, and estimates of herita- 
bility from twin data are not only of limited reliability, but fail to serve the human problem. 

A difficulty in the application of present genetic knowledge to human problems is that the theory of quantitative 
inheritance in populations arising by homogamy or aseortative mating has not been developed since the work of 
Fisher in 1918. Euclidation of this, as well as of other theoretical and practical problems only await sufficient de- 
ductive work together with appropriate observational data. As twin and sibling studies provide the most appropriate 
human material for the investigation of quantitative characters, some of the theoretical aspects of employing twin 
and sibling data for this purpose will be discussed. A method for partitioning the components of varance in quanti- 
tative attributes between and within twin and sibling pairs 2!so will be presented. 


An Experimental Program to Improve Construction Statistics. Carrer C. Osterninp, University of Florida. 


The need for improvements in the national and local construction statistics relexsed by the federal government 
is generally recognized. This recognition springs from certain significant discrepancies that have been found in the 
statistics, the sources of information and statistical techniques utilized, and the general lack of local coverage. 

This paper directs attention to the reasons that require more emphasis to be given to the quality of local 
construction statistics. Brief consideration is given to the ex ptual problems, the character of the need for con- 
struction statistics, the types of errors created by present estimating techniques, and alternative approaches to 
the correction of these errors. 

The remainder of the paper describes the construction statistics program at the University of Florida’s Bureau 
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of Economic and Business Research as one approach to the improvement of statisticsin the construction field. 
Among the topics covered are, the conceptual frame of reference, data sources and deficiencies. problems presented 
by the use of permit data, and methods of statistical adjustment for undercoverage and cost reporting. The roles 
of the systematic assembly and use of certain suppleiwentary data are explored. 

Attention is also given to the scope of the local work and the time, cost, and effort involved. 


A Visual Conceptualization of Linear Regression. Exiis B. Paar, Eastern Michigan University. 


The predictive role of correlation may be the most important statistical pt for behavioral students, yet 
the modal graduates of elementary courses have little grasp of it. Difficulty may stem from the algebraic language 
used in instruction, which according to recent studies is not meaningful to certain intellective structures. An aid to 
visual conceptualization of regression is presented, consisting of a backboard with a ber of colored movable 
parts. Displayed are two normal curves, the top of some predictor variable and the bottom of some predicted vari- 
able. First, instructor designates any point on top curve representing an available score usable for prediction. Next 
he selects some appropriate r and shows (by cut-outs) how predicted scores are affected by: (1) location of predictor 
scores, and (2) r between variables. As r improves, two events visually occur: (1) predicted mean moves closer to 
position of predictor score, and (2) predictive error approaches zero. Diagrams of aid may be supplied to interested 
professors. 


A Statistical Model for Diagnosing Zygosis by Ridge-Count. Donatp Ricuter, University of Minnesota anp 
Seymour Geisser, National Institutes of Health. 


Attention has been drawn recently by Smith and Penrose (Ann. Hum. Genet., 1955) and others to the possi- 
bilities of using total ridge-count as a criterion for diagnosing the zygosis of twins. In this paper, a statistical model 
for the behavior of ridge-count is formulated and studied. By utilizing available data for twins, it is shown that the 
model accounts fairly well for observed ridge-count variations; and by solving a distribution problem results are 
obtained which are useful for diagnosing zygosis not only in twins but in births of higher order. 


Unbiased Component-wise Ratio Estimation. D. 8. Rosson anp Currra Virnayasal, Cornell University. 


A rather typical experimental! situation in which the method of ratio estimation has found application is exempli- 
fied by the green weight determination of a field plot of silage corn supplemented by both a green and dry weight 
determination of a sample of plants from the plot. Plot total dry weight, the characteristics of interest to the experi- 
menter, is then estimated as the product of plot total green weight and sample dry weight expressed as a fraction 
of sample green weight. An alternative procedure in this situation is to stratify the plant material into the various 
plant components such as ears, husks, stalks and leaves before making weight determinations, and then estimating 
plot total dry weight of plant material by first estimating plot total dry weight of the separate plant components. 
If at the time of corn harvest the moisture percentage varies among these components of the corn plant then com- 
ponent-wise estimation may be expected to produce gains in precision. Applied to a collection of 40 corn plots of 
10 hills each in which ear and stover = husks +leaves+stalks green and dry weight determinations were made for 
each hill, component-wise estimation resulted in approximately a 50% gain in efficiency of estimation using either 
the biased or unbiased method of ratio estimation. 


Living Quarters and Household Concepts in the 1960 Census. J. HucH Rose anv Paut C. Guick, Bureau of the 
Census. 


The need for more complete information about living arrangements of people has led the Bureau of the Census 
to change its unit of enumeration from “dwelling unit” to “housing unit.” Consistent with the long-range goal of 
extending the coverage in the housing census, the 1960 Census will include all living quarters having some degree 
of “separateness.” The basic criteria of separateness in housing censuses since 1940 are being retained. Only the 
physical criteria for defining separat have been changed. 

Since a household is defined as the person or group of persons occupying a housing unit, the change will affect 
both the number of households and the number of housing units. It is estimated that the new definition will add 
about one million households, an equal number of occupied housing units, and some vacant units. More precise 
estimates will be available from post 1 evaluation surveys. 

The change reflects largely the sno ha of families and individuals living in partially self-contained housing 
units. The impact of wider coverage may be greatest in parts of urban areas where rooming houses and structures 
containing other light-housekeeping apartments are prevalent. Under the old procedures, the living quarters of 
some families or individuals in such structures were excluded, even though the quarters had “separateness.” 


Sampling Experiments on Ratio Estimators. ALAN Ross, University of Kentucky. 


The properties of the sampling distributions of the conventional ratio of means estimator (-X /z) are not known 
precisely for smal! samples. In this paper the distributions of ratio estimators are estimated for several small finite 
populations by computing the estimator for a number of samples. These distributions are compared with the corre- 
sponding distributions of unbiased ratio-type estimators. Comparisons of the standard error estimates are also given. 


Some Principles of Definition in Statistics. L. E. Rowesorrom anp D. H. SreinrHorson, Dominion Bureau of 

Statistics. 

The definitions examined in this paper are those used to determine the concepts and boundaries of phenomena 
observed in statistical surveys. The paper considers problems of definition confronting users and producers of eco- 
nomic statistics, and the role of the statistician in clarifying the requirements of use and survey definitions. The 
nature of definitional error is examined in relation to sampling and other sources of non-sampling error. Emphasis 
is on the probl of definition at the stages of survey initiation and tabulation rather than operation. The primary 
concern is with multiple purpose economic surveys. It examines the desirability of quantifying definitional error and 
explores some such possibilities. 
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Analysis of Multifactor Multiresponse Experiments with Mixed Factor and Response Types. S. N. Roy, University 
cf North Carolina, 


The traditional ANOVA has been mostly, though not exclusively, in terms of the planning and analysis of 
multifactor and uni-resp experi ts, some of the factors being unstructured and some structured and the 
response being a continuous (and thus necessarily structured) variate about which, for certain purposes, all that is 
assumed is that it has a mean and a variance and, for certain other purposes, we assume in addition, a normal 
distribution. An example of an unstructured factor is, say, five varieties of corn, or ten blocks, constituting five or 
ten levels of a factor, and an example of a structured factor is, say, six alternative doses of a fertilizer, constituting 
six levels of a factor. For practical purposes, even for the continuous case, a response could be pictured as observed 
frequencies in certain preassigned categories. If real numbers or values are associated with the categories, then it is 
a structured response. Otherwise, it is an unstructured response. Under any design and sampling scheme (associated 
with any multifactor experiment) what we obtain pupae isa set of experimental units, and then, according as we ob- 
serve each unit on one or several characteristics, it is a u Pp or mul ) , that is, a univariate or multi- 
variate probiem. In this paper, in the general set-up of maiitifasten and multiresponse experiments (where some factor 
types are structured and some unstructured and the same holds for the response types), questions are asked that are 
parallel to the questions in the traditional ANOVA. Depending upon the nature of the response type, the answer 
comes out in terms of an analysis that is sometimes the same as, sometimes close to and sometimes utterly different 
from the traditional ANOVA. In other words, not the usual arithmetic and algebra, but the spirit of the questions 
asked in the traditional ANOVA is considered to be more important and is used as a starting point. Extreme caution 
is also urged against indiscriminate use of transformations of variates, even before any questions or problems have 
been formulated, and it is pointed out thet such transformations and a blind use of the usual arithmetic of ANOVA 
might lead to utterly ingless cx 


On Significance Test of System Reliability. D. Rusensrein, General Electric Company. 


Let a system consist of component items selected from n diff t t types. The component items can 
be arranged in series and /or parallel. If the comp ts fail independently ond. according to the exponential probability 
law, then the reliability of the system is a function of the component failure rates; thus the reliability R=R(M, * + *, 
An) where \; is the failure rate of the i-th component type. Let the ith component type be tested for time Tj, let 
X; be the number of failures observed. For a large class of systems two tests are derived for the hypothesis that 
* * An) <Re against alternatives that R(Ai, + , An) >Ro. 

One test uses the criterion 


n 
Ss 


where s depends on the level of significance, Ro, R and 7; For this particular test an optimum choice of 7; is given. 
If the T; are arbitrarily given, 


nm 


is proposed as the test criterion where the a; are chosen in a nearly optimal fashion. The a; and s depend on the 
level of significance, o, R and 7;. The a; and s are easily derived. 


Price-Cost Behavior in Manufacturing Establishments. Ricuarp Ruaaues, Yale University. 


In order to understand the recent price and output experiences of the American economy, it is necessary to 
examine the price-cost behavior of manufacturing establishments. One of the major tools in such analysis is the 
concept of gross margin, which is defined as the difference between the total value of production and the direct costs 
of production (wages paid to direct labor and cost of materials used). 

Productivity, wage rates, and prices are basic elements which affect gross margins. Observations about these 
three elements suggest that (1) productivity gains are greatest when production is increasing; (2) wage rates exhibit 
a strong secular rise and less cyclical fluctuation than productivity; and (3) prices tend to move somewhat less than 
costs. On the basis of these observations it would be expected that gross margins would expand at the beginning of 
an upswing when productivity gains exceed wage costs and direct costs thus sag relative to prices. In the later 
phases of the upswing, the rise in wage rates in excess of the rise in productivity forces costs up and narrows the 
margin between prices and costs. This process continues into the depression, since wave rates continue to show a 
larger increase than productivity. 

Even if such an hypothesis is verified, however, cyclical variations in margins may still not be significant in the 
explanation of the general pattern of price behavior. Variations in price caused by variations in margins may be 
quantitatively small in comparison with the general effects of costs upon prices. If costs and variations in costs are 
the major determinants of prices, it will be necessary to examine further the determinants of wages and net profits 
as they differ among plants over the cycle. 


Comparing Health Statistics from Record Sources and from Household Interviews. Oswatp K. Sacen, Ruts E. 
DuN#BAM AND Watt R. Stumons, U. S. National Health Survey. 


Analysis of differences between ideal knowledge of a quantity and the product of a process for measuring that 
quantity can take many forms. Important categories for analysis are the hazards of conceptual differences, biases 
of a measurement, variances of the measurement (and sampling), process; and non-random errors of process-execu- 
tion. With a limited review of conceptual and analytic techniques for evaluating statistical data, the argument 
is offered that while substantial contributions are being made, there is no generally accepted model nor practice for 
validating a statistical measurement. Therefore many types of investigations should be undertaken and reported 
upon. The subject study compares prevalence of selected chronic conditions for a restricted population as measured 
by a response to a household interview and as indicated by service records of a group health organization in which 
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the population has membership. The hypothesis is that the two measurements definitely are not of the same con- 
ceptual phenomena, but that they treat two phenomena which have many elements in common. An effort is made 
to discover in what respects the two measures create the same image and in what respects they do not. Attempts to 
find causes for discrepancies place special emphasis upon a follow-up interview of physicians who treated the patients. 


Game Theory and Voting Behavior. Jack Sawyer AND Duncan MacRae, Jr., University of Chicago. 


In elections for the Illinois General Assembly, three members are elected from each district, and every voter 
has three votes, to distribute among the candidates in one of the following ways: 3-0-0, 2-1-0, 3 /2-3 /2-0, or 1-1-1. 
Thus a critical strategic decision for a party involves whether it should run one, two, or three candidates. It is this 
decision, made by a party committee, which is being modeled. The model applied is the two-person, zero-sum game, 
the payoff being the number elected. One of six payoff matrices obtains, depending upon the percentage of the vote 
the party receives: 0-25, 25-40, 40-50, 50-60, 60-75, or 75-100. To test the fit of the model (or the “rationality” 
of the nominating committee), data were obtained from the 1,377 district elections for the years 1902-1954. Ap- 
proximately ten per cent of the elections resulted in non-minimax outcomes. Further investigation of some of these 
elections suggests that the assumption of imizati lected is not always applicable. 


Where We Stand in Urban-Rural and Farm Residence Concepts. Henry D. SHeLpon, Bureau of the Census. 


The current definition of urban population goes back to the Census of 1910. For that census, the urban popu- 
lation was defined as the aggregate population of incorporated places of 2,500 or more. With minor modifications 
relating in large part to the treatment of New England towns, this definition was continued through the Census of 
1940. In 1950, the population of densely settled areas outside of the city limits of, but adjacent to, cities of 50,000 
or more was added to the urgan population, as was that of unincorporated places of 2,500 or more. This definition 
is to be used in the 1960 Census, again with minor modifications in the treatment of New England towns. The rural 
population has been, of course, the residual once the urban population has been defined. 

In recent years it has become apparent that the count of persons reporting themselves as living on farms in the 
Population Census exceeds by an appreciable amount the count of persons living on farms as determined by re- 
sponses to the battery of questions relating to acreage and agricultural production which appear in the Agriculture 
Census schedule. Studies of this problem in the Current Population Survey and in pretests for the 1960 Census in- 
dicated that some elaboration of the farm residence question in the Population Census tends to reduce the dis- 
crepancy; and, on the basis of these findings, appropriate modifications of this question have been made in the 1960 
Population and Housing schedule. 


An Asymptotic Expansion for the Distribution Function of Anderson’s Classification Statistic W. Rosepirx Srt- 
GREAVES, Columbia University. 


This paper is concerned with results obtained jointly with A. H. Bowker of Stanford University. Bowker has 
shown that Anderson's classification statistic W can be represented by 


U 
W=a— 
kV 


where k is a known constant, U is a function of the elements of a 2X 2 non centra) Wishart matrix and an independ- 
ent central t-variable, and V is a chi-square variable distributed independently of U. 

The usual classification procedures based on W are such that an individual is classified as belonging to a popu- 
lation n: if the value of W computed for him is greater than a cut-off point we, and to the alternative population 
nm: if W<woe. 


Critical Problems in Price Index Construction. B. R. Srauser, U. S. Department of Agriculture. 


In the nearly 200 years since Gian Rinaldo Carli first used a simple index number to measure changes in the 
purchasing power of money, index numbers have developed to a point such that now they affect the allocation of 
income flow to the extent of millions of dollars a year. 

The author reviews the history of the Wholesale Price Index, the Consumer Price Index, and Parity Index, 
pointing out the wide variation that in the past has characterized frequency of revision (intervals between revision), 
retroactiveness of revisions, reference points (base-periods), weight-base periods (period from which weights are 
derived), and commodity consist. 

He argues that with index numbers now used so widely in matters vitally affecting the economy, the time has 
come for a concerted effort to establish definite and recognized policies applicable to all major ind Such polici 
should as a minimum provide that revision of weights to placed on the same definite schedule, that the major 
indexes should utilize for their weights cross sectional surveys covering the same periods for the whole economy, 
and that they should adopt similar or at lease comparable criteria for determining commodity coverage. 


A Critical Evaluation of Stock Market Indexes. Ropert W. Storer, Manufacturers National Bank of Detroit. 


1. Different users of stock prices have differing needs. 

2. Corporate success is expressed not only in rising stock prices but also, in varying proportions, in a growing 
number of shares. 

3. This and other differences cause stock price indexes of various formulas, to reflect different aspects of the 
process and to yield different results. This fact vitiates many comparisons. 

4. The dispersion of behavior of individual stock prices causes any stock price index to be more or less un- 
representative of many issues’ action. 

5. Typical investment patterns of stockholders probably tend to aggravate for some indexes this lack of rep- 
resentativeness. 
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6. No single stock price index seems likely to meet better than all others all the needs of all classes of users 
over all time-intervals. 

7. All indexes known to the writer employ formulas which weight the readings quite heavily by either the prices 
or the aggregate market values, of a limited number of issues. This is undesirable, and nullifies to some extent the 
value of using a broad sample of issues. 

8. In order to minimize these disadvantages, it would de desirable to have an index number consisting, in part, 
of a broad-sample mean of price-relatives, avoiding either of the above explicit or implicit methods of weighting. 
Admittedly there would be involved the risk that lower-priced stocks, which have historically been more volatile 
percentagewise than high-priced issues, might then dominate the results. Also, the readings would need to be 
adjusted for increases in numbers of shares resulting from stock splits and stock dividends. This could be done by 
multiplying the reading by a factor representing the percentage i in the ber of shares since the base 
period, resulting from such actions. 


Health, Education, and Welfare Indicators and Trends. Lurner W. Srrtnanam, Department of Health, Education, 
and Welfare. 


Much effort has been devoted to the Department of Health, Education, and Welfare to bring together statistical 
information on a wide variety of subjects that relate to the human resources of the nation and to presenting this 
information in a form that will be useful to those who are concerned with the health, education, and welfare of 
the people. 

The primary objective of this effort has been to assemble, organize, interpret, and present statistics for policy 
use. In this connection there has been intensive examination of data of many classificati ic, demographic, 
cultural, ete.—to find those which are the best indicators of social progress and those which may be used to identify 
current and emerging problems. 

A part of the effort to make statistical data conveniently available for executive use has involved the develop- 
ment of the monthly Health, Education, and Welfare Indicators and its annual supplement, the Health, Education, 
and Welfare Trends. The purpose of this paper is to report on the develop t of these d ts and to describe 
some of the problems involved in bridging the gap between the production of statistics and their use by policy officials. 


Final Tract Tables to be Published for the 1960 Census. Conrap Taruser, Bureau of the Census. 


Results of the 1960 Census of Population and Housing are to be published by Census Tracts. The reports will 
include information collected on a 100 per cent basis, relating to all persons and housing units, and that collected 
on a sample basis. Tracts are small stable areas within cities and Standard Metropolitan Statistical Areas which 
have been developed in cooperation with local groups to provide a basis on which census and other statistics may 
be presented in a way that facilitates comparisons from one census to the next. Tracts have been recognized for 
180 areas, including all but 23 of the cities which are known to have a population of 50,000 and over. For 129 
Standard Metropolitan Statistical areas the entire area has been tracted. Tract statistics from the 1960 Census will 
be released in a series of bulletins, one for each area. The first release is expected about July, 1961; the entire series 
should be completed by December, 1961. Some of the information which is being collected on a 100 per cent basis 
is expected to be available earlier. This information may be made availeble prior to publication if needed to meet 
urgent local needs. 


Western Expansion and Ethnic Convergences in the Population of the United States. Irene B. Taruser, Princeton 
University. 


The Western expansion of the population of the United States provides a laboratory for the study of the inter- 
relations among social, economic, and demographic factors. The analysis of population changes among the ethnic 
and cultural groups is significant theoretically, and it is relevant to questions of future population trends in Asian 
countries. Over the last century, in the area that extends westward from Texas to the Philippines and southward 
from Alaska to the Micronesian islands, the movements toward lower fertility and smaller families have been as- 
sociated with such measurable variables as educational advance, upward occupational mobility, higher income, 
and urban or metropolitan residence. The major demographic differences among ethnic and cultural groups have 
been associated with the differences in the social and economic characteristics, the migrant status, and the areas of 
residence of the various groups rather than with factors that were specifically ethnic or cultural. 


Empirical Analysis of Segregation Indexes. Kanu E. Taruser, National Cancer Institute. 


Several indexes have been proposed to measure white-nonwhite residential segregation. Each index may be 
computed from data for any set of areal units. Using data for 60 cities for 1940, only a rough correspondence was 
found between values of a given index computed from block data and computed from tract data. Intercorrelations 
of eight indexes, all computed from 1950 block data for 188 cities, reveal sizeable empirical differences between the 
indexes. Values of segregation indexes for U. S. cities thus may differ because the areal units from which they are 
computed differ, or because of differences in the way the areal data are utilized. For intercity comparisons, the con- 
ceptually best index is the Gini index of concentration computed from block data. This index can be approximated 
closely with the more easily computed index of dissimilarity based on block data. 


Concepts and Methodology of Savings: A Study on Mexico. Franx M. Tamaana, The American University. 


A study of savings in Mexico was undertaken in 1958, under the auspices of the Center for Latin American 
Monetary. Studies, with three objectives in mind: 

1. To review concepts of savings and forces infl g the formation and uses of savings, with emphasis on 
their application to countries in process of ic de t: 

2. To study methodological problems related to colnaiien of data and their presentation in tabular form, to 
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reflect interrelations between sectors of the economy, the intermediary role of financial institutions, and sources 
and forms of savings and physical investment; 

3. To develop possible uses of savings data in current economic analysis, with particular reference to their 

pplication to fi ial and monetary policies, in combination with other statistical presentations and social ac- 
counts (national income, flow of funds). 

The study had an experimental and pilot character, in that the country and the year were selected with a view 
to study the possibility of extending its application to other countries and developing continuing savings series. 
An important consideration was therefore flexibility in approach, so that its processes of methodology and analysis 
could be applicable to changing institutional and economic conditions. 


Multivariate Correlation Models with Mixed Discrete and Continuous Variables. R. F. Tatz, Cincinnati Bell 
Telephone Company, AND INGRAM OLKIN, Michigan State University. 


Correlation Is are idered in the following setting: X =(X1, X2, - - + , Xx) isa Bernouilli random vector 
(k=2, 3,+ ++); ¥=(¥1, ¥2,° ++, Yp) is a continuous random vector (p=1, 2, ) whose conditional distri- 
bution given X =z is multivariate normal with unknown mean vector depending on z and unknown covariance 
matrix independent of z. Small sample distributions, limiting distributions, relati between par ters, and ap- 
plications are discussed for various multiple and partial correlation coefficients. Special emphasis is placed on the 
particular cases k =2, p>1l and k>2, p=1. 


The Minimum Requirements Approach to the Urban Economic Base. Epwarp L. ULLMAN AND Micwaet Dacer, 
University of Washington. 


1. Five groups of U. 8. cities in size classes ranging from 2,500 to 800,000 were picked and in each size class, 
38 cities, the total universe in the largest class, were chosen at random. 

2. For every city the percentage of total labor force employed in each of a 14 industry census classification of 
employment was calculated. 

3. In each of the five size groups the lowest percentage employed among the 38 cities in each of the 14 employ- 
ment categories was picked and these minima were d to give an indication of total minimum requirement, 
or @ consistent approximation of the total internal or nonbasic employment of a city. 

The sums of the minima by city size classes are as follows: 2,500-3,000 =24%; 10,000-12,500 =33,.2%; 25— 
40,000 =39.8%; 100-150,000 = 43.1%; 300-800,000 =48.6%. Regression lines were also calculated. The larger the 
city the larger 1s the percentage, a finding consistent with theory, since the larger the city the more self-contained 
it should be. 

Other tests and interpretations of the data yielded consistent comparisons, valuable in understanding cities, 
even if the precise application to economic base ratios is not idered. The pt is similar to that employed 
by Alexanderason and Morissett, although conceived independently and calculated differently. 


Some Statistical Problems in an Organization Research Project. Harrison Wuire, University of Chicago. 


In carrying out a case study of the position of R & D managers in the top management of a manufacturing 
corporation problems in statistical inference which appear to be somewhat novel emerged in four types of data: 
sociometric choices, vocabulary attributions, legitimate information scores, and perceptions of attention allocation. 
Several of these prob] will be defined. Attention will be primarily directed to an analysis of the stochastic nature 
of the psychological] process by which a manager estimates the division of some R & D manager's attention among 
categories of activity suggested by the observer. A crude model of this process will be proposed in terms of the allo- 
cation of percentage units of attention among Z micro-categories of attention which it is suggested are the units 
in the estimator’s own thought processes. The number of micro-categories assigned to each category imposed by 
the observer's question is assumed to reflect a hypothetical long-term estimate by the respondent manager. The 
inadequacy approaches to the correction of these errors. 

The remainder of the paper describes the construction statistics program at the University of Florida’s Bureau 
of E ic and Busi Research as one approach to the improvement of statistics in the construction field. 
Among the topics covered are, the conceptual frame of reference, data sources and deficiencies, problems presented 
by the use of permit data, and methods of statistical adjustment for undercoverage and cost reporting. The roles 
of the systematic assembly and use of certain supplementary data are explored. 

Attention is also given to the scope of the local work and the time, cost, and effort involved. 


A Representation of the Wishart Matrix. Ronert A. Wusman, University of Illinois. 


Random orthogonal transformations (see Adn. Math. Stat. Vol. 28 (1957) p. 415) can be used to great advantage 
to solve certain distribution problems in multivariate analysis. The method leads to the following representation of 
the Wishart matrix A in the case that the population covariance matrix 2 is the identity: A=7T’, where T isa 
triangular matrix with z-variables on the diagonal and N(0, 1) variables off the diagonal. all variables being inde- 
pendent. If = is not the identity, then A =CTT’C’, in which CC’ =~ (Ann. Math. Stat. Vol. 30 (1959), p. 597). 
Some of the applications of this representation of A are: deriving the distribution of A; generating values of A by 
& random process; obtaining representations for Hotelling’s 7, Wilks’ generalized variance, the sample correlation 
coefficient, the sample multiple correlation coefficient (all these also in the non-null case), and the sphericity criterion 
for the bivanate distribution in the null case. The sample correlation coefficient r has the following representation: 


r Tu + Tup/ V1 — p* 
vl—r Tn 
in which p is the population correlation coefficient and the 7';; are the elements of a lower triangular 2X2 matrix T 
of the kind described before. The right hand side of this expression can be described as a non-central ¢-variable 
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(apart from a constant) with a random non-centrality parameter which is proportional to a X-variable. The mono- 
tonocity of the probability ratio follows then almost at once. Similar remarks hold in the case of the sample multiple 
correlation coefficient R, where R?/(1—R*) is obtained as tially a non-central F-variable with random non- 
centrality parameter proportional to a X*-variable. For the sphericity criterion Z (T. W. Anderson, “An introduc- 
tion to multivariate statistical analysis,” Wiley, 1958) in the bivariate case we obtain a representation of Z/(1—Z) 
essentially as an F-variable. 


Unbiased Estimation. W. H. Wimu1ams, McMaster University. 


In sampling theory one attempts to develop estimators of population parameters, such as the mean yy of a 
variate y, which are unbiased and have good precision. Ratio and regression estimators have beeen constructed to 
give better efficiency by utilization of a correlated concomitant variate. The ratio estimator is ¥ =Jux/z 
where § and # are sample means and ux is the unknown population mean of the auxiliary variate z. The estimator 7 
is biased but does possess desirable variance properties. The usual regression estimator assumes a linear model and 
an estimator, %, of uy is obtained by evaluation of the line of least squares fit at ux. Thus 75 = 9 +b(ux —Z) where b 
is the least aq slope estimator. This estimator is unbi if the linear model assumption is valid but is biased 
otherwise. 

A procedure is given which will generate exactly unbiased ratio and regressi stimators. The form selected 
for an arbitrary coefficient determines whether the generated estimators fall into the class of ratio or regression 
estimators. For example, if this coefficient is of the form of a least squares slope estimator, then the unbiased esti- 
mators are very similar to §. Another form results in the known Hartley-Ross unbiased ratio estimator. 

Application of this procedure to separate strata is immediate and the argument is extended to form a “combined” 
estimator for stratified and multistage sampling. 


Some Aspects of Unemployment in the United States. Seymour L. Woirsetn, Department of Labor. 


The past several years have witnessed a substantial increase in the analyses of the component parts of the un- 
employed (their demographic and economic characteristics) and unemployment (frictional, structural, seasonal, 
etc.). This disaggregative approach, i.e. , examining the anatomy as well as the global total of unemployment has 
been accelerated by the growing availability of data on the subject and the realization that they help toward a 
more ingful perception of the significance of unemployment and an assessment of suggested programs to meet 
the problems of unemployment. 

This paper exarnines one aspect of the disaggregative approach, by studying the monthly data for one economic 
cycle (1957-58-59) on the gross changes in unemployment. It asks and attempts to provide at least tentative answers 
or hypotheses on four questions: 

1. How many separate individuals are actually involved in generating a given month’s unemployment? 

2. When all the turmoil sifts down, what per cent of a given month’s unemployment is left over from the pre- 
vious month? 

3. Who becomes newly unemployed each month? How many come from employment? How many from outside 
the labor force? 

4. What happens to those who leave unemployment each month? Do they become employed? Do they leave 
the labor force? 

The available data are also examined for seasonal and cyclical patterns and differences by sex are also noted. 
Further avenues for research are indicated. 


The Use of Rotating Samples in the Census Bureau’s Monthly Sample Surveys. Ratpn S. Wooprvurr, Bureau of 
the Census. 


Rotating samples are used in a number of the Census Bureau's monthly surveys because of one or more of the 
following advantages: (1) rotation spreads the burden of reporting among more respondents; (2) rotation permits 
the use of data from past samples to improve the current estimate through the use of the composite estimation 
procedure; (3) large observations, which may be the principal source of variance, may be identified in each rotating 
panel, and placed on a special panel at a reduced weight. 

The first of these three advantages, is not considered further. The estimation procedures, variances and opti- 
mization of the composite estimation procedure and large observation procedure are examined. 

The monthly retail trade survey of the Bureau of the Census is used as an example of the use of rotating 
samples. The use of a rotating area sample in this survey has produced large reductions in the variance of level 
estimates for a single month (as compared with similar estimates from a non-rotating sample of the same size). 
The variance of sums of monthly estimates (such as quarterly or annual totals) has been reduced to an even greater 
extent but the variance of the ratio between two consecutive monthly estimates has been slightly increased. 


A Comprehensive Training Program in Industrial Statistics. A. W. WorrHam, Texas Instruments, Inc. 


Since very few universities actually have a comprehensive training program for industrial statistics, and since 
the subject of industrial statistics is very close to a necessity in present day business, it is necessary that the training 
department of industry consider seriously the evolution of such a comprehensive training program. 

This paper will present the industrial evolution for the comprehensive program that is planned and is in progress 
at Texas Instruments. It is naturally suited to the particular business with which Texas Instruments is involved. 
Therefore, the program may not necessarily have broad applicability to other industrial activities. It does, never- 
theless, show that an evolutionary process can take place which will lead to a thorough and comprehensive program. 
The span of this program covers some three years in its evolution. It began with the most basic courses and finally 
in its full developed form includes many specia! short courses that supplement and complement the program to 
make it complete. 
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Determinants of Consumer Demand for House Furnishings and Equipment. Vernon G. 
Lippitt. Cambridge: Harvard Economic Studies, vol. CX. Harvard University Press, 
1959. Pp. xiv, 172. $6.00. 


Ricuarp Strong, Cambridge University 


HIs is an excellent book with which the author won the David A. Wells Prize for 

1956-57. It is the kind of detailed study of a single commodity group which is 
particularly useful at this stage in the development of demand analysis and one, 
moreover, which contains methodological suggestions of general interest. In it the 
author campaigns for the general acceptance of two important procedures in de- 
mand analysis: the combination of cross-section and time-series information and 
the extensive use of multivariate analysis of variance methods in analysing cross- 
section data. These propositions should command general assent and the author 
illustrates their power in his empirical work. The first is important because there are 
many determinants of demand, some of which can best be examined by looking at 
household budgets, others by looking at developments over time. The second is 
important because it enables a number of effects to be isolated (the author examines 
the effect of income , family composition and age, occupation, city size, and region) 
without specifying the form of functional dependence and in such a way that the 
basic question of the independence of these effects can be checked in an efficient 
manner. In the cases he examines the author finds that the interaction of effects is 
relatively small. This is of interest because it means that in forecasting it is only 
necessary to specify future marginal totals and not the complete set of future cell 
frequencies in a multivariate distribution. In this connection it may be worthwhile to 
mention that similar methods are useful in dealing with another problem in demand 
analysis, namely, the treatment of seasonal effects in the analysis of monthly and 
quarterly data. I have tried to set out the basic ideas in my O.E.E.C. report on price 
and quantity index-numbers and have found the method very convenient in practice. 
It has been further developed here by J. A. C. Brown for the purpose of extracting 
symmetric substitution matrices from short-period data on prices and quantities 
alone. It would appear that these methods are not well understood by economists 
since they are frequently criticised for their rigidity when, in comparison with other 
methods, they are in fact extremely flexible. 

The analysis of cross-section data, which occupies a long chapter II, is based 
primarily on the study of consumer purchases made in 1935-36. It enables a number 
of effects to be isolated and leads to the interesting conclusion that these can be re- 
garded as substantially independent and therefore additive. 

After various experiments, the author decides to use the per cent of income spent 
as the dependent variable and in chapter III he forms a time series of what he calls 
the combined distributional effects, that is the effects of income, family composition, 
occupation and region, by assuming that their influences on the percentage of income 
spent remain constant over the years. On this basis the values found for 1935-36 are 
applied to data for other bench-mark years, specifically 1929, 1941, 1947, 1950 and 
1953. 

The time series analysis is based on data for 1929-41 and 1946-53. Consumers’ 
expenditure on furnishings and equipment is the dependent variable and the results 
can be summarised as follows: (1) real disposable income is an important factor; (2) 
the correction of the income effect for changes in the combined distributional effect 
makes little net difference; (3) some importance attaches to changes in the number of 
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newly married couples, a category which, together with households which had moved 
in the last year, was excluded from the 1935-36 survey; (4) the introduction of prices 
somewhat improves matters; (5) the introduction of the change in disposable income 
makes very little difference; (6) replacement needs are rejected as an independent var- 
iable because their influence appears to be negative. On the whole the residual vari- 
ances in the different regression equations are small even when calculated for the 
whole period while the coefficients are estimated from the prewar period alone. 

There is one point at which I believe the author’s interpretation of his results is in- 
correct and the following argument serves only to support his empirical findings 
against his own rejection of them. The problem occurs on p. 122 (equation 9) where 
Lippitt introduces a measure of replacement needs as a possible explanatory variable 
in his time series analysis. The introduction of replacement needs substantially re- 
duces the residual variance but Lippitt rejects it because it has a negative sign. But 
this is not necessarily incorrect. Replacement needs are obtained by applying a retire- 
ment curve to each year’s purchases and summing the retirements so obtained for each 
year. This series varies with the stock and, given a constant retirement curve and a 
stock which, while still growing, had achieved a stable age-composition, would be pro- 
portional to that stock. If the series is interpreted as a series for stocks, then its effect 
on purchases may well be negative. In the depreciation model which I published with 
D. A. Rowe (Econometrica, July 1957) the sign depends, in a linear model, on the 
ratio of the rate of consumers’ adjustment to equilibrium to the depreciation rate. 
If this product is greater than one, then the opening stock will exercise a negative 
effect on the purchases of the subsequent year. In our recent work with linear models, 
the product turns out to be greater than one for furniture and furnishings and for 
the other durable groups, radio and electrical goods and cars as well. More recently 
I have developed a general model which takes replacement as well as depreciation 
into account and, while I have not yet made any calculations with it, I expect to find 
a similar effect when it is applied to actual observations. The practical result of this 
empirical finding is that the immediate effect of a rise in income on purchases is larger, 
and not smaller, than the ultimate effect which is reached as equilibrium is ap- 
proached. The reason is that in the case envisaged current needs for maintenance plus 
the new investment brought about by the rise in income are together greater than the 
ultimate needs for maintenance (there are none for new investment) when equilibrium 
is reached. 

I see the point of the author admitting to his six-year-old daughter that his graphs 
are only designs. But I can assure him that to me they are very real and I am sure 
that most of his readers will agree with me. 


Stichproben aus endlichen Gesamtheiten. (Sampling from finite populations.) Frank- 
furter Wissenschaftliche Beitrige: Band 17. Giinther Menges. Frankfurt am Main: 
V. Klostermann, 1959. Pp. 179. 8.50 DM. 


Herman Wo pn, University Institute of Statistics, Uppsala, Sweden 


fy Germany towards the end of the 19th century the social sciences were the frame 
for significant developments in theoretical and applied statistics. Internationally 
wellknown names like Lexis, Becker, Knapp, Laspeyres and Paasche formed the nu- 
cleus of a German school of socioeconomic statistics. It is one of the intriguing ques- 
tions in the history of statistics how it was that this flourishing period came to an 
abrupt end. The question often comes to mind when reading modern German sta- 
tistical literature, and in particular so with G. Menges’ doctorial dissertation. Three 
features may be emphasized. First, the sharp antitheoretical attitude that dominated 
official statistics from the turn of the century, alienating German statisticians from 
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the epochmaking developments in England and the States. Menges’ book is an en- 
deavour to catch up with modern sampling theory. An expository survey is given, 
with some background material from probability and decision theory. Numerous 
references (p. 60) recall that sampling practices were in widespread use in Germany 
in the period 1900-1915. Second, a point closely related to the first, statistical teaching 
in Germany is by long tradition a cathedra affair, the students being taught'statistical 
theory and applications from lectures and by reading, not by laboratory training in 
the handling of statistical figures and procedures. In line herewith, Menges’ book 
contains almost no figures or statistical tables, and several of his comments have the 
flavour of Lesefriichte, fruits of reading. 

The third point is more positive. In its flourishing period around 1900, German 
statistics was integrated with important developments in the social sciences in 
general, Max Weber’s sociological works being a typical illustration, and this gave 
German statistics a broad outlook and linked it up with central problems in the 
social sciences in a more articulate way than in the other countries I have referred 
to. Notably, problems of explanatory analysis on the basis of nonexperimental data 
is an area that has always been strongly emphasized in German statistical literature, 
a tendency that curiously enough goes hand in hand with the antitheoretical attitude 
of the official statistics. It is an attractive feature of Menges’ book that he keeps up 
this tradition, and it is also typical and attractive that he emphasizes the aspects 
of explanatory analysis that belong to the general theory of knowledge. It is quite 
another thing that the area at issue is underdeveloped, both in Germany and else- 
where, if compared with explanatory analysis on the basis of experimental data.! In 
recent years, however, the interest in this same area has been markedly on the in- 
crease. This last development has escaped the attention of Menges. Taking for exam- 
ple his argument that inference from finite samples vs. infinite populations gives rise 
to an epistemological distinction between immanent vs. transcendent inference, his 
point would be more in line with modern theory of knowledge if formulated in terms 
of model construction. For another thing, his distinction between procedures of 
statistical inference in natural vs. social sciences would be more to the point if framed 
as a distinction of experimental vs. nonexperimental data as a basis for explanatory 
inference. 


Finite Mathematical Structures. John G. Kemeny, Hazleton Mirkil, J. Laurie Snell, and 
Gerald L. Thompson. Englewood Cliffs, N. J.: Prentice-Hall Inc., 1959. Pp. 487, $8.25. 


Patrick University of Chicago 


HE purpose of Kemeny, Snell, and Thompson’s recent Introduction to Finite 
Mathematics was to introduce biology and social science students to some of the 
ideas of modern mathematics. The volume under review is essentially that book, re- 
vised to meet the needs of the engineering and physical sciences. Whereas the former 
book presupposed only high-school mathematics, calculus is required in some parts of 
the present one. 
Following chapters on the propositional calculus and elementary set theory, comes 
a treatment of probability theory in discrete spaces. The topics covered include con- 
ditional probability, random variables (a term never used), finite-state-space sto- 
chastic processes, means, variances, Chebyshev’s inequality, the weak law of large 
numbers, the binomial and Poisson distributions, and a statement of the central limit 
theorem for Bernoulli trials. A brief final section deals with a specific problem of 
statistical inference (essentially the tea-tasting experiment). 


1 For a recent review see H. Wold, Journal of the Royal Statistical Society, Ser. A, Vol. 119, p. 28 (1956). 


378 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1960 


The next two chapters are on elementary linear algebra and convex sets. The linear 
algebra treated includes vectors and matrices, abstract vector spaces, linear trans- 
formations, bases and dimension, with applications to linear and differential equations 
and geometry. The material covered is well done, but since no spectral theory is 
included, it is really not sufficient for students of statistics. The chapter on convex 
sets includes a treatment of linear programming but, unfortunately, none of game 
theory. 

There follows a chapter on finite Markov chains covering the basic theorems for 
ergodic chains and absorbing chains. Although the theory is neatly handled, there are 
some lapses of rigor. For example, it is stated that in an absorbing chain there is 
probability one of eventual absorption, while actually a result of the “weak” type 
is proved. This is unfortunate since in general the authors have been careful to call 
attention to places where rigor is impossible without more advanced concepts and 
methods. 

The final chapter introduces some of the notions of measure theory and of proba- 
bilities in a continuum. Distribution functions, densities and moments of a single 
random variable are discussed, as well as joint distributions and sums of independent 
variables. 

By and large the book is well written and should prove a useful introduction to the 
topics covered. Of course it is no substitute for individual courses on these topics; 
in particular it does not supply all the mathematics needed for an intermediate sta- 
tistics course. 


Economic Forecasts and Policy. H. Theil. Amsterdam: North-Holland Publishing Com- 
pany, 1958. Pp. xxxi, 555. $13.35. 


V. Lewis Bassi, University of Illinois 


HE primary focus of the book is the application of the techniques of mathematical 
Waseiunbes and statistics to the problem of forecasting. The analysis—devoted 
primarily to evaluating forecasts and forecasting procedures in mathematical terms 
—reaches its culmination in chaps. VII and VIII, which deal with the role of fore- 
casting in the dynamic context of decision-making under uncertainty. The problem 
of the decision-maker is considered in five aspects: incomplete knowledge of the cur- 
rent situation; partial control of the outcome through measures for changing some of 
the variables; means for making predictions of the uncontrolled variables; a prefer- 
ence scale for evaluating alternative outcomes; and choosing the set of measures that 
will produce the “best” result. » 

As a rule, the problems involved are presented in a form that makes them soluble 
by maximizing a quadratic function subject to linear constraints. The solution then 
becomes “a straightforward generalization of the maximization of a consumer’s 
utility function subject to a budget restriction.” The advantages of this approach 
derives from its use of the loss of welfare—defined as the difference between the level 
of utility attained and the maximum attainable—as a “natural measure of the serious- 
ness of the forecasting error.”’ Techniques relating to all aspects of the decision- 
maker’s problem are appraised from this standpoint. The analysis has the same rele- 
vance to practical forecasting as utility-indifference functions have to other economic 
decisions. 

Testing the predictive auality of actual forecasting procedures is a major objective. 
The actual forecasts appraised are those obtained from the macro-economic models 
used by Dutch and Scandinavian government agencies and the German industry fore- 
casts obtained from the surveys known as the Munich Business Tests. The over-all 
conclusion of this appraisal is that the forecasts are “neither thoroughly bad, nor 
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unsurpassable.” A large body of empirical evidence supports the conclusion that 
“the most prominent forecasting error is the underestimation of changes.” Various 
factors accounting for this type of error are considered; important among them are 
tendencies to report no change or to assume no change in certain determining factors. 
Other imperfections of forecasts are also given critical attention. 

Of most direct interest to the statistician is the appraisal of methods for making 
and testing forecasts. The problems of statistical inference are treated in Chapter VI, 
which, for all the complexities of mathematical treatment, rests upon a point of view 
that is essentially practical. To illustrate, it is pointed out that “multicollinearity is 
one of the worst aspects of estimation”; that for forecasting purposes, it “is not very 
important as long as the correlation is maintained in the prediction period”; but that 
the policy-maker may nevertheless need to know the separate influence of controlled 
and uncontrolled variables. 

In the debate over the methodology of estimation, the author leans heavily toward 
least squares. He formulates “a class of methods (the k-class), which contains least- 
squares, limited-information and a third method, two-stage least-squares, as special 
cases. The third method is asymptotically equivalent with limited-information, but 
it is considerably simpler.”’ This simpler method, which loses none of the advantages 
of the more complex solution, is a major contribution. It is pointed out that in prac- 
tice the least squares method, combined with the already-noted tendency toward 
underestimation of changes leads to a forecasting situation in which “two errors tend 
to compensate each other.”’ Hence, “... least squares is an estimation method 
which, though not perfect, cannot easily be surpassed; and the same is true for two- 
stage least-squares in the estimation of simultaneous equations.” 

Again, “ ... the method of least-squares can no longer claim to have the brilliant 
properties which earlier econometricians thought it had. But, as far as experimental 
estimation in small samples is concerned, the method may be regarded as one of the 
few one-eyed men who is eligible for king in the country of the blind. Since the validity 
and the sharpness criteria do not give a clear answer to the question which method 
is to be chosen, the simplicity criterion should convince us to use least-squares. It is 
clear, however, that the classical standard errors tend to overestimate the accuracy of 
its points estimates. ‘Not significant,’ is, therefore, an expression that may be pro- 
nounced with louder voice than ‘significant’.” 

Lest these lucid quotations mislead the unwary, the reader is warned that the 
book does not provide much easy reading. Basically, it is written for professionals in 
econometrics, and large parts of it will be incomprehensible to anyone not versed in 
the mathematics of matrix relations. Nevertheless, there are certain values the prac- 
tical forecaster can realize (not without study but without understanding the mathe- 
matics fully). Among them are a knowledge of rigorous methods for testing forecasts, 
a summary of forecasting experience, especially in Europe, and numerous insights into 
forecasting processes and relationships in a setting of the kind of problems forecasts 
are intended to solve. 


Thirty Years of Research in Human Fertility: Retrospect and Prospect. Papers Presented 
at 1958 Annual Conference of the Milbank Memorial Fund, Part IT. New York: Milbank 
Memorial Fund, 1959. Pp. 157. $1.00. 


Eve yn M. Kitacawa, University of Chicago 


T° MEMORIALIZE the 30th anniversary of the Milbank Memorial Fund’s Division 
of Research and its work in population, a round table under the title “Thirty 
Years of Research m Human Fertility” was organized for the Fund’s 1958 Annual 
Conference. Actually, however, five of the nine papers summarize substantive results 
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of two important current studies of fertility: the “Growth of American Families 
Study,” undertaken jointly by the Scripps Foundation for the Study of Population 
Problems and the Survey Research Center of the University of Michigan, and based 
on interviews in 1955 with a national probability sample of white women; and the 
“Study of the Future Fertility of Two-Child Families,” a longitudinal study being 
conducted by the Office of Population Research, Princeton University, of which the 
first interviews taken in September 1956 provided the basic bata for three papers in 
the symposium. 

The four papers concerned with either retrospect or prospect are Hajnal’s “Study 
of Fertility and Reproduction: A Survey of Thirty Years,” Ryder’s “Appraisal of 
Fertility Trends in the United States,” Grabill and Schachter’s discussion of “Needs 
and Prospects for New Census and Registration Data on Fertility,” and Nelson’s 
evaluation of the “The Present Status of Research in the Biological Control of 
Fertility.” 

Hajnal’s contribution is an illuminating discussion of the development of the theory 
and measurement of fertility and reproduction in the past thirty years, surveying 
the dominance in fertility analysis of the “use of reproduction rates, supported by 
the stable population concepts” during the first two decades, and the post World 
War II criticism of the net reproduction rate and the accompanying shift to cohort 
analysis. He makes explicit the usefulness of each approach; describes the difficulties 
of assessing the long-run implications for population growth of “current reproductive 
behavior,” a nebulous concept anyway in view of the marked fluctuation of fertility 
with economic conditions; and in general clarifies the issues involved in the admittedly 
difficult task of measuring reproduction. 

Ryder’s appraisal of fertility trends brings a breath of fresh air to the dilemma of 
choosing “between distorted period indices [fertility rates for a given date] and those 
for incomplete or non-contemporary cohorts.” With a three-dimensional model in- 
cluding age of women, time, and fertility, he derives equations relating cohort fer- 
tility rates and period fertility rates, using data from Whelpton’s monograph on co- 
hort fertility; and then makes projections of completed fertility for currently fertile 
cohorts. Several interesting conclusions result. For example, the impossibly high level 
of first order total fertility rates for recent years (more than 1,000 first births per 
1,000 women) is due primarily to the recent marked decline in the mean age of mother 
for first births, which caused an increase in the overlap of successive cohorts’ fertili- 
ties. He also finds that “by progressive elimination, first of large families and then 
of very small families, the American pattern of procreation has become more and more 
homogenized,” and concludes that in the future the time pattern of fertility may be 
the most changeable demographic measure. This interpretation of recent trends ac- 
counts, in part at least, for the increased attention given to the study of patterns of 
child-spacing, or the timing of births, in current research. 

This research emphasis is illustrated in the papers by Potter and Sagi. Potter 
shows that the intended length of pregnancy postponement can have an important 
effect on measures of contraceptive effectiveness. Sagi separates birth intervals into 
two components—an intended (voluntary) period of infertility, and a residual—and 
finds that correlations with attitudinal variables are higher for the intended com- 
ponent. 

Freedman’s paper on “Fecundity and Family Planning” analyzes the 1955 nation- 
wide data on subfecundity and the effectiveness of contraceptive practice together 
because awareness of subfecundity is related to the use of contraception. While dif- 
ferences in reproductive capacity (subfecundity) could not account for fertility dif- 
ferences between major social groupings, they appeared to be a partial explanation 
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of the lower fertility of working wives. He concludes that by and large in relation 
to the effects on fertility, the number of unwanted children born appears almost to 
balance the number the Subfecund want but can’t have. 

In his analysis of the 1955 data on socio-economic differentials in fertility, Campbell 
concludes that the “expectations of the wives interviewed suggest that there is a 
strong tendency for differentials to narrow.” Westoff reports on “Religion and 
Fertility in Metropolitan America,” using data for the Princeton study, and finds that 
religious preference exerts a significant influence on fertility, not only in terms of 
differences in fertility-planning success and birth intervals, but also on the number of 
children desired; Catholics, Protestants and Jews rank in that order from high to low 
fertility patterns. Campbell, however, was inclined to discount the higher expectations 
of Catholic women reported in his data, and thought that they would not have as 
many children as they indicated they wanted. 

The high quality of all of the contributions to the symposium makes this a valuable 

k. 


Statistics of State School Systems: 1955-56, Organization, Staff, Pupils, and Finances. 
Biennial Survey of Education in the United States, 1954-56. U.S. Department of Health, 
Education, and Welfare, Office of Education. Washington, D. C.: U. 8. Government Printing 
Office, 1959. Pp. xi, 140. $.45. 

Current Expenditures Per Pupil in Public School Systems: Small and Medium-Sized 
Cities, 1956-57. Circular No. 538. U.S. Department of Health, Education, and Welfare, 
Office of Education. Washington, D. C.: U. 8. Government Printing Office, 1958. Pp. iii, 40. 
$.25. 

Participation in Adult Education. Circular No. 539. U. S. Department of Health, Education, 
and Welfare, Office of Education. Washington, D. C.: U. 8. Government Printing Office, 
1959. Pp. x, 37. $.30. 

Statistics of Land-Grant Colleges and Universities, Year Ended June 30, 1957. Circular 
No. 541. U. S. Department of Health, Education, and Welfare, Office of Education. Wash- 
ington, D. C.: U. 8S. Government Printing Office, 1959. Pp. vii, 101. $.55. 


Francis G. Cornet, Engelhardt, Engelhardt, Leggett and Cornell 


pores four statistical reports of the U.S. Office of Education represent four distinct 
categories of statistical service of that agency. The state school systems volume is 
archival. It is the final accounting for the school year 1955-56 of numbers of pupils, 
staff, and receipts and expenditures in public elementary and secondary schools in 
the states and territories of the United States. Its function is largely historical. The 
publication lag of three years limits its usefulness in administrative affairs of the 
various state systems. Its pattern was established early in the history of the Office of 
Education, 1869-70, so that it is a part of an historical series. 

The second publication on current expenditures per pupil is one of a series of reports 
published periodically by the Office intended primarily for administrators of the 
school systems desiring comparative expenditure data. The Office has been able to 
shorten the time lag in this series. A later release in the series covering the school 
year 1957-58 was published in the fall of 1959. 

The adult education survey brings the Office program into new dimensions, both in 
substance and in method. It is not a recurring study, it reaches a widely dispersed, 
rarely studied area of school attendance which is difficult to enumerate. It accom- 
plishes this through an efficient modern statistical method. 

The fourth is much like the State School Systems report in that it is one of a re- 
curring series which has continued for 88 years. The data are a complete accounting 
of land-grant institutions as to students, staff, and finance. The value is primarily 
documentary and historical. It differs, however, in that it is a special report of a 
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special group of higher educational institutions with which the Federal Government 
has some reason for expecting accountability since it contributes funds to the Land- 
Grant institutions. It presents much more comprehensive statistics than are available 
for other higher educational institutions. 

The four reports have one thing in common. They relate to the rudimentary core— 
numbers of students, staff, and/or dollars—-of information about some of the various 
school populations with which the Office must deal. 

The State School Systems, Current Expenditures, and Land-Grant volumes are a 
part of the basic reporting program of the Office. As general purpose summaries they 
represent comprehensive reporting. The State School Systems document, for example, 
contains adequate text, along with basic tables, and such analytical material as an 
historical summary for the continental United States from 1869, a size distribution 
of school districts, enrollments in relation to total population and to school-age popu- 
lation, and unit expenditures by state over several years. Limitations of the report 
in timing and in accuracy are related to the decentralized character of education in 
the United States. For state school system statistics the Office is piagued by depend- 
ence upon the voluntary cooperation of 50 some odd state agencies which in turn 
depend upon basic data from over a million teachers, bus drivers, custodians, and 
collation by over 50,000 local school districts of vastly varying size and administrative 
character which serve as intermediate units in the reporting chain. 

Commendable progress has been made in standardization of items and improving 
state systems of reporting. A handbook has been developed as a guide for uniform 
statistics in public elementary and secondary schools. Title X of the National De- 
fense Education Act is a stimulus to modernizing statistical operations in the States. 

To bridge the gap in the irreducible minimum of time lag between collection and 
dissemination the Office is issuing prepublication releases in the form of preliminary 
circulars and multilithed summaries. A brief survey each fall supplements the basic 
series with significant enrollment information for both public schools and higher edu- 
cational institutions. These data are disseminated during the early part of the cur- 
rent year. 

The adult education circular was accomplished by including questions in the 
Bureau of Census Current Population Survey for October 1957. It is indicative of 
many special statistical and research reports now being produced by the Office to 
supplement the basic statistics on education. It also suggests the more sophisticated 
methodology which is developing in the Office statistical programs. 

The four reports are but a part of an evolving pattern on educational statistics at 
the national level and a small segment of the current production of the Office. The 
reader should thus recognize that even though they together are representative, the 
complete available statistical story of American education is contained in many other 
products of the Office program. 


The Income of Nations. Paul Studenski. New York: New York University Press, 1958. 
Pp. xxii, 554. $25.00. 


Irvine Kravis, University of Pennsylvania 


TUDENSKI, Professor Emeritus of Economics at New York University, has produced 
S a work of extensive scholarship dealing with the history, theory and practice of 
national accounting. 

About the first third of the volume is given over to the history of the concept and 
estimates of national income with the great preponderance of attention devoted to 
the period before 1900.  » addition to the widely-known pioneering efforts of William 
Petty and Gregory King the author calls our attention to the work of many other 
writers in England, France, Italy, Germany, Russia, the United States and Australia. 
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Economists and statisticians share the stage with national income estimators drawn 
from many other professions including Lavosisier, the great French chemist, Lagrange 
the famous mathematician, Arthur Young, the great agricultural pioneer, and William 
Pitt, Prime Minister of Great Britain. Indeed, it was a former engineer, Timothy A. 
Coghlan, who was responsible for the preparation of the first official national income 
series ever published—for New South Wales beginning in 1886-87 and for seven 
British colonies in Australasia beginning in 1890. Studenski also sketches the histori- 
cal background in which the various contributors worked, particularly in the case 
of the Russians. 

The second major portion of the book is devoted to theoretical problems in national 
income accounting, estimating methods, and international comparisons. The pages 
dealing with theory touch upon the main issues and include a survey of the views of 
the leading writers in the development of national income concepts, but, possibly 
owing to limitations of space, the reader will have to look elsewhere for a close 
analysis of the pros and cons of the alternative positions described. With respect to 
international comparisons, some independent work on variations in the structure of 
production and income is essayed. The results while not without interest are limited 
by the amount of attention the author was able to give to the subject, and more in- 
tensive work has alrea‘ly been done by Kuznets and others, some of it undoubtedly 
appearing after Studenski had completed this section. 

Part Three of the volume contains reviews of the income estimates in 13 countries, 
including the Soviet Union, Yugoslavia, Germany, France, and India. Many of the 
country chapters, which average about a dozen pages in length, had the benefit of 
review by an expert engaged in the work described in the chapter. In Part Four the 
accounts of 66 other countries are given much briefer treatment. 


The very considerable value of this massive volume arises mainly from its extensive 
coverage of sources of national accounts data throughout the world. On this account 
alone it deserves a place in every library that caters to scholars working in areas re- 
lated to the growth of national income and to comparisons of the size and structure 
of national income from time to time or place to place. 


Statistics of Extremes. LZ. J. Gumbel. New York: Columbia University Press, 1958. Pp. xx, 
375. $15.00. 


G. J. LizBerMan, Stanford University 


tatistics of Extremes is the definitive work to date on this important subject. 
It is a complete treatise on the subject, bringing together many fundamental 
studies by scientists working in different fields and in many countries. 

The systematic treatment of extreme values starts with methods which are distri- 
bution-free. After some initial distributions have been considered, more general 
methods are developed, which hold for certain types of distributions; finally the 
asymptotic theory is constructed. 

The book comprises eight chapters. Each chapter consists of several sections 
divided into paragraphs. Each section starts with a statement of the problems. The 
first chapter introduces those statistical tools which will be frequently used, especially 
the concept of intensity function, taken from actuarial statistics, and return period, 
taken from engineering practices. 

The second chapter introduces distribution-free methods and culminates in the 
proof that a forecast of the number of exceedances of the extreme is more reliable 
than a similar forecast for the median. The third chapter shows the general properties 
of extremes which do not require the knowledge of the initial distribution; in particu- 
lar it sets bounds on the increase of mean extremes with sample size. 

The fourth chapter is devoted to exact distributions of the extremes. Here the initial 
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distribution, the parameters contained therein, and the number of observations must 
be known. Two types of initial unlimited distribution are worked out: the usual ex- 
ponential type, and the Cauchy type, which possesses a longer tail. Since the distri- 
butions of the extremes depend upon the behavior of the initial distribution at large 
absolute values of the variate, the two types possess different properties with respect 
to the extremes. A third type consists of certain limited distributions. 

As long as small samples are considered, no new parameters enter into the distri- 
bution of extremes. This situation changes as soon as large samples are approximated 
by the three asymptotic theories which are studied in chapters 5 to 7. Here two new 
parameters appear, which are connected with the intensity function and return 
period. 

The first asymptotic distribution of extreme values valid for the exponential type 
of initial distributions seems to be the most important. chapter 5 is devoted to the 
study of this theory. Applications especially to floods are given in chapter 6. The 
second and third asymptotic distributions valid for the Cauchy type, and for certain 
limited distributions, are shown in chapter 7. Certain empirical formulae derived by 
engineers for the breaking strength of materials are special cases or generalizations 
of the third asymptotic distribution of the extremes. 

The last chapter deals with functions of extremes, their sum, difference, quotient, 
and geometric mean. 

The book contains many detailed examples of the use of extreme values in such 
diverse fields as flood control, dam, bridge, reservoir, and hydroelectric plant con- 
struction, mortality tables, metal fatigue and building codes. 

The book will be of “extreme value” to statistically minded people who are inter- 
ested in the theory of extremes. However, it is possible that the book will not reach 
many scientists who are interested in the subject matter, but are not well versed in 
statistics. This is unfortunate, since the book is a treatise on the subject and contains 
material of vital interest to such people. Dr. Gumbel is to be commended on bringing 
together all the important work in such an important field. 


'Intraclass Correlation and the Analysis of Variance. Ernest A. Haggard. New York: 
The Dryden Press, 1958. Pp. xx, 171. 


SitGREAVES, Teachers College, Columbia University 


HE intraclass correlation coefficient was first introduced to this reviewer some 
5 pte ago as a measure of the variation of mean values from class to class in one- 
way classification problems of Model II of the Analysis of Variance. Alternatively, 
for these problems, it is the usual product moment correlation between any pair of 
values within a class. For this case, the population value of the intraclass correlation 
coefficient is defined by 


Oa? 
oa" + ov? 

where o2 is the component of the variance due to the classificatory variable and 02, 

is the variance of measurements within each class. If we have m classes with k obser- 

’ vations in each class, the mean square for between classes S?, say, is an unbiased esti- 

mate of (ko2+02), while the mean square for within classes S2, say, is an unbiased esti- 

mate of o3. The usual estimate for R, therefore, is 


S2 S,? 


S? + (k —1)S.*- 


The present treatise traces the history and logic of the statistic R, beginning with 
Karl Pearson’s estimate of the correlation between the heights of brothers. A dis- 
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cussion of the computation and properties of the statistic for one-way classification 
problems is given without specific formulation of a probability model, and a compara- 
ble statistic is defined for two-way classification problems. The various analysis of 
variance models are then presented, and the statistic is discussed in the framework 
of these models. Finally, applications are given to reliability studies, to pattern 
analytic studies, and to data in the form of ranks. 

The book is difficult to read because it is poorly organized and written, particularly 
from a statistical point of view. The exposition is diffuse and wordy, and frequently so 
many details are presented that proper emphasis is lost. This seems especially true 
in the historical development, and in the discussion of the analysis of variance models. 

A more serious fault is the fact that the author tends to use statistical terms loosely 
and without proper definition. Thus, the term “P-value” is applied generally to a sta- 
tistic, rather than to a specified critical region for the statistic. In a number of in- 
stances, statements are made which at first reading seem in error. In some cases, the 
statements can be justified if the reader is willing to spend time translating what the 
author says into what presumably he means. In other cases, such translations remain 
in doubt. One such case is that of the confidence interval given for R on page 23. (In- 
cidentally, it is not usual to speak of confidence limits of a statistic, but rather of con- 
fidence limits for a parameter.) The given confidence limits seem to be based on the 
F-test in which the test statistic is formed by putting the larger mean square in the 
numerator, but it is not clear that this leads to a meaningful confidence interval. The 
usual two-sided confidence interval for R with confidence coefficient (1—a), based on 
an observed F-ratio F» in which the between classes mean square is always in the nu- 
merator, is defined by 


F.(i — R) 


Fya(mi, m2) < DE Fi-4a(m, nz) 


where 
1 


F ™) 


is the (1/2)ath percentile of the F distribution with n, and nz degrees of freedom for 
the numerator and denominator, respectively. 

It seems to this reviewer that the organization of the book would have been im- 
measurably improved if in the early chapters, the author had formulated and dis- 
cussed separately a series of probability models for situations in which the parameter 
of interest is the population product moment correlation between pairs of observa- 
tions within a class. The present development devotes the first four chapters to 
sample statistics with only glancing (and confused) references to population values, 
while in the fifth chapter these statistics are simultaneously embedded in the frame- 
work of all current analysis of variance models, not all of which are meaningful for the 
basic problem involved. Among other things, formulation of a specific probability 
model would clarify the meaning of a negative population correlation which does not 
arise naturally in the analysis of variance models. Further, in terms of a model, the 
population correlation between pairs of observations within a class is the usual bi- 
variate product moment correlation between two variables with the same means and 
variances. It is easy to show mathematically that the appropriate estimate for this 
correlation is the sample intraclass correlation coefficient. In this form, no question of 
the use of any sample product moment correlation coefficient arises. 

Because of the many ambiguities involved, the book seems of doubtful value. It 
is useful, perhaps, in directing attention to the notion of the intraclass correlation 
coefficient and to possible areas of application. In any specific problem, however, the 
statistician will need to carry through his own justification for the procedure he adopts. 


Pya(m, m2) = 
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Financing of Economic Activity in Canada. Wm. C. Hood, including “A Presentation of 
National Transactions Accounts for Canada,” 1946-54, by L. M. Read, S. J. Handfield- 
Jones and F. W. Emmerson. Ottawa. Royal Commission on Canada’s Economic Prospects, 
July 30, 1958. Pp. xv, 699. 


E. F. Beacu, McGill University 


HIs is one of some thirty odd studies done for and published by the Canadian 

Royal Commission on Canada’s Economic Prospects which is generally known as 
the Gordon Commission after the name of its chairman. These studies will probably 
have more over all impact than the deliberations and recommendations of the Com- 
mission itself, and this would have been true even if there had not been a change in 
government. 

The work of the Commission has given a tremendous lift to economic research in 
Canada. This result was achieved partly by the way in which the Commission ar- 
ranged for the studies. In addition to the substantial collection of experts in Ottawa, 
many of the studies were farmed out to economists and statisticians in the employ 
of private business throughout the country. This may have been in the interest of 
economy because the number of studies is so large; but it had an additional effect of 
involving a large number of the business organizations in the country. Of those econ- 
omists who were not directly employed in this way, many were employed in preparing 
briefs to be presented to the Commission. Such widespread activity has made 
Canadian businessmen and Government personnel aware of the need for such study 
and more idea of what it can accomplish. Another result is that there is a very sub- 
stantial quantity of published material available on Canada and its problems. Since 
the market for economic literature is not extensive in a country of some seventeen 
million people with two languages such royal commissions play a very important part 
in producing this extensive and very useful material. The Gordon Commission is 
very similar in this respect to the Royal Commission on Dominion-Provincial Finan- 
cial Relations, the well known Rowell-Sirois Commission of twenty years before. 
Some of this literature is of especial importance. The economic history of Canada 
written for the earlier Commission by Dr. Mackintosh has been much used since. 
For the more recent Commission the study of Output, Labour and Capital by 
Hood and Scott is outstanding. 

The Rowell-Sirois Commission was instrumental in setting up our first comprehen- 
sive estimates of national accounts which were subsequently taken over by the Bureau 
of Statistics and made a regular part of the statistical work of that Bureau. The 
Gordon Commission, has given us our first National Transactions Accounts which 
are published as the last third of the book under review. The guiding spirit in this 
task was, of course, Professor Hood. Space does not permit special appreciation of 
all those who played a part, but there was very active cooperation with the Bureau of 
Statistics and the Bank of Canada. It is a notable achievement, and we may presume 
that these Accounts will now form a part of the regular reports of the Bureau. 

Even without the National Transactions Accounts in this volume, the material 
which Professor Hood presents in the first two thirds would make it one of the most 
important of the many studies undertaken for the Commission. Here is a very able 
attempt to study all aspects of capital aspects in Canada. But the job is too big. To do 
it justice Professor Hood should have devoted himself entirely to this task rather than 
to the many other things he did for the Commission. As it was, the actual writing 
was done under pressure after he was back lecturing, and a number of areas, such as 
housing mortgages had to be omitted entirely. The net result however is a very useful 
book which lays out ground that very much needed study, and the stage is now set 
for any number of graduate thesis studies over the years. Even as it is, the book 
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presents much useful and interesting material not found elsewhere, and is already 
being used in university classes. Some of the most interesting material lies in the 
concluding chapter. 

Hood has attempted to explore the various capital markets of the country, both 
short and long run, to evaluate their inter-relations and particularly the efficiency of 
their functioning. How well do they allocate the capital among the various needs; 
what are the obstacles to such efficient functioning; how responsive are demanders 
and suppliers to interest rate and other changes? 

In attempting to obtain answers he has faced serious gaps in our statistical infor- 
mation and has made use of United States data. He has also made extensive use of 
the United States literature. He has formulated some tentative findings, but these 
are perhaps less significant than the exploratory work he has done. The summaries 
at the ends of the chapters help to find one’s way through such a comprehensive 
treatise. There is, unfortunarely, no index. The detailed table of Contents is no sub- 
stitute, and is needed in any case. It should be said that the material published here 
should have been published as two books. The Transactions Accounts are an achieve- 
mer worthy of publication as a book and much more easily found and used that way. 
The capital studies of the early part of the book under review do make use of these 
figures, but this alone is not an adequate justification for producing such a large tome. 


The Logic of Investment Planning. S. Chakravarty. Amsterdam: North-Holland Pub- 
lishing Co., 1959. Pp. xii, 170. $4.00. 


Houuis B. Cuenery, Stanford University 


His is the eighteenth volume in the distinguished series, Contributions to Economic 

Analysis. Like most of the earlier volumes, its purpose is to explore the usefulness 
of economic models as a guide to economic policy—in this case, in the planning of 
investment. Although the analysis is entirely formal, Chakravarty aims at providing 
some guidance to practical planners. 

In the first five chapters, Chakravarty surveys a range of familiar growth models 
with particular attention to dynamic inter-sector systems derived from von Neumann 
and Leontief. Although Chakravarty stresses the importance of using an explicitly 
dynamic model for investment planning rather than the usual single-period projection 
for five years, his analysis does not shed much light on the nature of the differences to 
be expected in the results since it stops with the algebraic formulation. 

It is only when Chakravarty abandons the assumption of a closed economy in 
chapter VI that his models approach the realistic context of development planning. 
Here he is particularly interested in the question of import substitution, which he 
calls “the structural break.” His treatment of this and other problems of choice, 
which should be a central feature of the book, is disappointing. Chakravarty shows 
little familiarity with the large body of literature on investment priorities and makes 
no attempt to integrate the conclusions of partial analysis with his interindustry 
model. He seems content to show that if a certain number of variables is fixed “on 
policy grounds” the remainder will be determined from the model. The basis for 
choice among alternative investments is covered in three pages with a brief reference 
to Tinbergen’s Design of Development, and the analysis of a comparative advantage is 
also very incomplete. The emphasis is again on the existence of determinate solutions 
rather than on the logic of choice. A recognition that linear programming is the 
logical tool of analysis for this problem would have clarified the discussion greatly. 

Although I am very sympathetic to Chakravarty’s objective of trying to treat in- 
vestment planning in a more systematic way, I can only conclude that he has fallen 
far short of his goal. Mathematical aspects of the various models are stressed to the 
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exclusion of their economic rationale, and crucial assumptions are introduced with 
little regard for their economic significance. Empirical aspects of the models are com- 
pletely ignored, so that there is no basis for distinguishing between important and 
irrelevant refinements in the model. The reader interested in the practical possibilities 
of interindustry models for investment planning is likely to get many more insights 
from the theoretically cruder methods of the Economic Commission for Latin America, 
for example, where models are discussed against a background of actual country 
studies. Although Chakravarty is a disciple of Professor Tinbergen, his main weak- 
ness is precisely in the area of Tinbergen’s main strength: the judicious mixture of 
economic judgment with formal analysis. 


The Labor Force Under Changing Income And Employment. Clarence D. Long. Princeton: 
National Bureau of Economic Research, General Series No. 65; 1958. Pp. xxiv, 440. $10.00. 


M. W. Reper, Stanford University 


HIs is a landmark in empirical studies of labor force behavior. That it will not 
‘wan altogether fresh to students of the subject is due in good part to the fact 
that, in mimeographed form, it has been widely circulated for several years, and has 
already had an appreciable impact upon researchers in the field. Nonetheless it will 
probably be many years before it is displaced as “the” book in its field. 

The author is concerned with describing and explaining the factors that determine 
the percentage of a population that enters the labor force. As a purely statistical com- 
pilation, it is a fine telephone book; there are lots of good numbers. But it is more than 
a telephone book; Long makes a serious effort to explain his findings. His analyses 
are of two principal kinds: (1) Comparisons of labor force participation rates at dif- 
ferent points of time in the same “place” and (2) simultaneous comparisons of these 
rates in different places. In both types of analysis, the principal relationship investi- 
gated is that of labor force participation rate, 1, and income, with unemployment, 
length of work week, education, etc. entering as secondary influences upon I. 

As l varies considerably among races, sexes, age groups, rural and urban areas, etc., 
analysis of the relation of 1 with income and other variables of economic interest 
requires elimination of these sources of disturbance. This could have been done by 
fitting a multiple regression of | on these nuisance variables, income, etc., and study- 
ing the various partial regressions. However, Long chose to follow (and improve upon) 
Douglas’ technique of standardizing labor force composition and then analyze the 
standardized data. Though it would be instructive to compare the results from the 
two techniques, I doubt that it would have affected many of Long’s conclusions. 

From an analytical point of view the more important of these conclusions concern 
the variations of | within particular age-sex-race categories. It is impossible in a 
brief review to consider these in any detail; there are a great many of them and more 
than a few are quite involved. Fortunately, the first chapter contains a detailed, 
chapter by chapter, summary of the study’s empirical findings from which the 
marginally interested reader can decide whether and where to proceed further. 

For the most part, Long considers his study to be of the supply side of the labor 
market. He concludes (p. 29) that “under comparatively static conditions, the higher 
the income the lower the participation, with rather great changes in income required 
to bring about moderate changes in the over-all participation rate.” Given popu- 
lation quantity and composition, labor supply is a diminishing function of per capita 
income with a low elasticity. To the obvious query, how do you know you are not 
observing a demand relation intersected by a shifting supply relation, Long offers 
several rather brief and rather cryptic answers (pp. 29-30). Frankly, the reviewer 
feels that more discussion and investigation of this issue is needed. 


j 
fi 


BOOK REVIEWS 389 


Long’s contention that his relationships are those of supply seems to be confined 
to long-run and/or cross-section analyses; t.e., inter-area comparisons and secular 
movements. For short-period movements, it is contended that | does not respond to 
the quantity of employment opportunities available (p. 30); 7.e., that variations in 
unemployment and related phenomena are primarily caused by fluctuations in de- 
mand acting upon a highly inelastic supply. 

As an account of aggregate labor force behavior, this is plausible—at least as a 
first approximation. The problem is to reconcile this aggregative model with the 
analyses of the various labor force strata. Long does not pretend that the supply 
functions of these strata remain unchanged over time; e.g., he explains the secular 
trend in / for females as due partly to reduction in time needed for household duties. 
and partly as the result of improved education. Both factors cause a downward shift 
ir the supply curve of this type of labor and the latter has differential supply effects in 
the white-collar and blue collar labor markets. Therefore what happens to / and to 
earnings in these markets will partly reflect the character of (and shifts in) the de- 
mand function. Now it is possible that an unchanging aggregate supply function can 
result from a set of inter-related and shifting sectoral supply functions; by implica- 
tion, Long asserts that this is what has happened. But it will take far more analysis, 
especially of the behavior of labor demand, before this conclusion can be fully 
accepted. 

On the descriptive level, the book is excellent. (The overlap of this book with 
Gertrude Bancroft’s monograph, “The American Labor Force,” is very slight.) The 
detailed descriptions of short-run labor force behavior in deep depressions (chap. 
10) and in war-time (chap. 11), especially the inter-county comparisons, will be 
standard references for many years. This statement also applies to the description 


of over-all secular constancy of | resulting from off-setting variations in the participa- 
tion rates of different labor force components (Chapter 12). 

In short, this book is both a summary of our empirical knowle“ ze of labor force 
behavior, and a starting point for further analytical work. Anyone interested in the 
subject must read it. 


The World’s Metropolitan Areas. International Urban Research. Berkeley and Los 
Angeles: University of California Press, 1959. Pp. vii, 115. $3.00. 


Henry S. Suryock, Jr., U. S. Bureau of the Census 


I TuHIs little book, International Urban Research of the University of California, 
under the direction of Kingsley Davis, has tried to increase the comparability of 
data on urban communities by demarcating the boundaries of the world’s metropoli- 
tan areas. The first third of the book consists of a discussion of concepts and a de- 
scription and justification of the procedures actually followed. The remainder is 
given over to photolithographed tables showing the population of individual areas 
and their central cities and to source notes and data on the political units comprising 
each area. 

The authors believe that the best delineation of a large urban community would 
take the form of an urbanized area or conurbation. Unfortunately, such a delineation 
would cut across the administrative units which are usually the smallest areas recog- 
nized in official statistics and could be carried out only if arranged for at the time 
of the census enumeration. Accordingly, the present project was forced to make use 
of “Metropolitan Areas” (M.A.’s), which may admittedly contain some rural territory 
but which have advantages in the form of the availability of statistics on various sub- 
jects. An “M.A.” is defined as “An area with 100,000 or more inhabitants, containing 
at least one city (or continuous urban area) with 50,000 or more inhabitants and 


L] 


390 AMERICAN STATISTICAL ASSOCIATION JOURNAL, JUNE 1960 


those administrative divisions contiguous to the city (or to the continuous urban area) 
which meet certain requirements as to metropolitan character.” 

To those familiar with recent developments in the United States in this field, the 
parallels in terms of both choice of concepts and of definitions are striking. Even 
though the United States Census does publish figures for urbanized areas, primary 
emphasis in publications has been given to metropolitan areas—essentially for the 
reasons stated here. Moreover, this study began with the official U. S. definition of a 
Standard Metropolitan Area and modified it as needed to suit differences in urban 
development abroad and in statistical and administrative practices. For example, 
since data on worker commuting were available for only a few countries, more reliance 
was placed upon density gradients. 

Of the 1,046 M.A.’s established, data for only the central city could be obtained 
for 31 per cent, most of which are in Communist countries. Nonetheless, the table 
represents a useful advance over the conventional compilations of city populations 
in such compendia as the Demographic Yearbook of the United Nations. Since the 
goal here is international comparability, it does seem surprising that the authors do 
not explicitly recognize the phenomenon of the international metropolitan area. Two 
of them (Berlin and Jerusalem) unobtrusively make an appearance, however. 


Manpower and Innovation in American Industry. Samuel E. Hill and Frederick Harbison. 
Princeton: Industrial Relations Section, Princeton University, 1959. Pp. vii, 85. $2.00. 
Paper. 


CLARENCE D. Lona, Johns Hopkins University 


or half a century, this country has been investing enormous sums in education. 

Yet most of that time, economists have concentrated attention on the quantity 
of labor and tangible capital as the source of economic progress. Business also has 
been preoccupied with problems of unskilled and semi-skilled labor as a result of the 
rising power of mass unions; and the lack of concern with quality has been reflected 
in neglect of the highly skilled worker and in the lagging remuneration of scientists, 
expert staff, and middle management. Only since World War II—with the develop- 
ments in chemistry and electronics, and the release of new orders of magnitude of 
energy—have economists and business men begun to realize that the great stream of 
innovation has its real source in the human mind. This little monograph of Professors 
Hill and Harbison is a welcome addition to the growing literature on the quality of 
the labor supply. 

The authors begin with a few figures showing the tremendous growth of the higher- 
talent worker as a proportion of the labor force. In the ten years, 1947-57, employ- 
ment of manufacturing production workers rose by 130,000, or 1%, while that of 
non-production workers rose by 1,380,000, or 55%. Is this growth temporary or 
permanent? What are its underlying causes? Does it represent the working out of 
Parkinson’s Law: that the industry bureaucrat, like his government counterpart, is 
mainly concerned with multiplying work for himself and his subordinates? Is it due 
to the ever-heavier burden of government reports? Or is it a concomitant of the 
greater complexity of larger-size firms, industries and communities? Since the official, 
aggregative, statistics are too crude for answering these questions, the authors turn 
to individual firms, giving special attention to the years 1947 through 1955. The 
number of firms is naturally limited because of the paucity of satisfactory records 
and the heavy burden of interviewing. In the end, the study relied on fifty firms 
scattered through eighteen two-digit industry groups, of which eleven were manu- 
facturing and the rest construction, railroads, utilities, communications, trade and 
banking. Five industries were represented by one firm, five others by two, only one 
industry by as many as seven. 
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The method was to find out, from company records, what changes had occurred 
in the proportions of executive, administrative, professional, technical, sales and 
clerical personnel; then to ask what developments in company policies and behavior 
had caused them. Incidentally, as the study progressed, the authors became impressed 
with how little the companies had previously known about the changes. 

The majority of firms experienced increases in the proportions of these higher-level 
employees, the greatest increases taking place among the technical and professional 
personnel. Some firms, however, experienced no change, and even decreases, for one 
or the other of the higher-level categories. 

Attention was given to ascertaining the causes of the increases. A particularly use- 
ful device was to compare the histories of two companies in the same industry: one 
company in which extensive innovation took place, the other in which innovation 
was minor. This comparison was made for two textile firms, two machinery firms, two 
electric power firms, and two natural gas transmission firms. Studies were also made, 
within the same company, between plants with marked changes in product, output, or 
process, and plants with little change. This was done for paired plants in electrical 
equipment manufacturing, in a steel company, and in a chemical company. Third, 
comparison was made of shifts in the occupational structure of eight firms in different 
industries. 

The conclusions of the study were— 

1. The chief cause of the increase in the percentage of highly-skilled personnel was 
innovation. 

2. The rate of increase was governed by the rate of innovation. 

3. The biggest increase in percentage of highly-skilled personnel was among com- — 
panies that did their own innovating, the smallest among those that bought their 
improved equipment from other firms or employed outside consultants. 

4. Innovation of one firm tended to force innovation of others, through competition. 

5. Mere size and empire-building were relatively minor factors, and government 
reporting burdens were important only for government-regulated industries. 

From these findings, the authors draw some inferences concerning future trends. 
A major inference is that substitution of high-level labor is likely to increase in the 
decades ahead, leading to increases in overhead costs, decreases in the proportions 
of workers in unions (unless unions do a better job of organizing white collar workers), 
and stiffer competition among firms in hiring engineers, creative scientists, managers, 
and administrative staff. These future trends will call for drastic changes in personnel 
policies: in the direction of basing pay on individual capabilities rather than on job 
descriptions, improving work incentives, developing executives with the brains and 
vision needed to preside over high-quality subordinates, and greater emphasis on 
forward planning of manpower. 

This is a timely and stimulating little book. A possible criticism is that it rests 
many, rather bold, conclusions and inferences on a small sample of firms and on a 
rather casual examination of that sample. But the approach is refreshing and at 
times ingenious. Professors Hill and Harbison deserve praise for blazing at least a 
faint trail toward a large and promising wilderness of problems and ideas. It is a trail 
over which future empirical investigation can bring up heavier statistical equipment, 
to carry out a more thorough exploration. 


Automation and Computing. Andrew D. Booth. New York: The Macmillan Co., 1959. 
Pp. 158. $5.00. 


Grorce E. Forsytue, Stanford University 


N SPITE of the great importance of the Computer Revolution in the world today, 
there are comparatively few books on the subject written for the non-specialist. 
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The present book is one of the few, and is a readable account for the university- 
educated person who would like to gain some impressions of the field. It would help 
the reader considerably to have majored in science or mathematics. 

The author has been active in digital computing in England for over a decade, and 
he and K. H. V. Booth have separately and together written several expositions of 
numerical methods, computer programming, and automatic computers. The author 
is now head of a Department of Numerical Automation at the University of London. 

The book covers a vast area—from the history of automation, via the engineering 
of digital computers and automatic factories, to playing chess on a computer. Obvi- 
ously only the surface could be scratched in 157 pages, and yet the author does 
succeed in conveying the flavor of the new technology. Somehow, he failed to cover 
Statistics. There are some three dozen references to other sources on the subjects 
covered. 

The author has strong opinions on many topics. He tends to play up his own con- 
tributions to computer technology, and to play down those of others. It is startling 
to find an Englishman coming out four-square against democracy (p. 20). 

The chapter titles are as follows: 

I. History of Automation and of Analogue and Digital Calculation 
II. The Logical Design of Digital Calculating Machines 
. The Physical Basis of Computer Design 
. Programming for a Digital Computer 
. Analogue Computers 
. Automation in Clerical Work 
. The Control of Continuous Processes 
. Automatic Machine Tools and Assembly Processes 
. Strategic and Economic Planning 
. Non-numerical Applications of Computing Machines 


Proceedings of the Fifth Annual Computer Applications Symposium. Armour Research 
Foundation. Chicago: Armour Research Foundation, 1959. Pp. x, 153. $3.00. Paper. 


B. E. Exutson, Lockheed Aircraft Corp. 


HIs volume contains the papers presented at a two-day symposium in Chicago, 
October 29 and 30, 1958. 

The first day of the symposium was devoted to papers discussing business and man- 
agement applications of computers: (1) “Operations Research and the Automation 
of Banking Procedures,” by R. A. Byerly, discusses a study the objective of which 
is “the development of criteria for use by individual banks in optimizing their opera- 
tions, considering size and other characteristics.” (2) “Information Systems Moderni- 
zation in the Air Materiel Command,” by D. E. Ellett, reviews the program for “the 
modernization of the Materiel Information Service through the use of electronic com- 
puters (Univae 1105, IBM 709) and advanced communications.” (3) “Utilization of 
Computers for Information Retrieval,” by A. Opler, defines “information-retrieval,” 
and enumerates steps in the process. Characteristics of areas best suited to electronic 
computer retrieval application are listed. A number of information-retrieval systems 
are described briefly (IBM 650, 701, 702, 704, 705, Bendix G-15D, Datatron, SEAC, 
Univac, 305 RAMAC, and special-purpose computers). (4) “Problems and Prospects 
of Data-Processing for Defense,” by C. A. Phillips, (Luncheon Address), describes 
“efforts in the Department of Defense to develop long-range programs and policy 
guidance and management concepts for an over-all data-processing program.” (5) “An 
Integrated Data-Processing System with Remote Input and Output,” by R. D. 
Whisler, describes an approach to the purchase of an electronic computer (NCR 304), 
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and plans for order-processing with a computer and a private-wire communication 
system. A number of other operations scheduled for programming on the computer 
are listed. (6) “The Role of Character-Recognition Devices in Data-Processing Sys- 
tems,” by R. A. Harrell, describes several character-recognition devices, explains the 
principles upon which they are based, and mentions some applications. Printed-code 
sensing and automatic character-recognition are distinguished. Optical and magnetic 
character reading are compared. (7) “Input-Output—Key or Bottleneck?” by R. D. 
Elbourn, argues that “the bottlenecks of input-output have not been so much in 
speed alone as in logical flexibility and in variety of medium and format,” discusses 
the elimination of bottlenecks by new techniques, and concludes that developments 
leading to the removal of bottlenecks are the keys to a host of new computer appli- 
cations. 

The second day of the symposium was devoted to papers discussing engineering and 
scientific applications of computers: (8) “Scientific Uses of a Medium-Scale Computer 
with Extensive Accessory Features,” by R. A. Haertle, discusses the operation of a 
Basic IBM 650 Computer with accessory features; describes its use in a ballistic 
missile simulation computer program and in reduction of data obtained during 
missile-flight tests, and indicates other tasks which it has performed. (9) “The Design 
of Optimum Systems,” by R. R. Brown, defines and discusses four methods, adapted 
for the computer, of obtaining a “sub optimum” of the system design corresponding 
to a relative minimum for a non-linear programminog problem: (a) the method of 
steepest descent; (b) the optimum gradient method; (c) Newton’s method; (d) the 
accelerated optimum gradient method. (10) “Computer Applications in the Numerical 
Control of Machine Tools,” by R. B. Clegg, defines “numerical control,” classifies 
numerically controlled machine tools, gives examples and illustrations of numerically 
conirolled machine tools, outlines some numerical control systems, and forecasts some 
future developments. (11) “Frontiers in Computer Technology,” by R. W. Hamming, 
(Luncheon Address), discusses a personal vision of a computer that lies in the future, 
the hardware, the method of using it, and the class of problems on which it is to be 
used. (12) “Computer Sharing by a Group of Consulting Engineering Firms,” by 
E. M. Chastain and J. C. McCall, presents reasons for the establishment of the Mid- 
west Computer Service, Inc., outlines the feasibility study and preliminary planning, 
describes the organization of the computer service, the equipment (Bendix G-15D) 
and the operation of the service, and gives an evaluation of the organization. (13) 
“Current Developments in Computer Programming Techniques,” by F. Way III, out- 
lines plans for a multistage process in translating from the pseudo-code (problem- 
oriented language) to the machine language, and describes what has already been 
done in automatic programming at the Case Institute of Technology (IBM 650) 
Univac I). (14) “The Future of Automatic Programming,” by W. F. Bauer, presents 
a “survey of the automatic programming frontiers.” The following four areas are 
discussed : (a) multilevel language structures; (b) language development, standardiza- 
tion, and translation; (c) automatic computer operating systems; (d) modern com- 
puter developments. 


Report on the 1950 World Census of Agriculture, Volume II, Census Methodology. 
United Nations, Food and Agriculture Organization. Rome: United Nations, 1958. Pp. ix, 
167. $2.00. Paper. 


D. Gaus Jounson, University of Chicago 


_—_ 1950 World Census of Agriculture was the third effort to obtain reasonably 
comparable census data on agriculture throughout the world. The agricultural area 
covered by countries which participated in the program included less than sixty 
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per cent of the total agricultural area of the world. If China and the U.S.S.R. are 
excluded, the included area was seventy-one per cent of the total. Cooperation was 
nearly complete in the Americas and Oceania, but quite limited in Asia and Africa. 

The present volume, which is the second of three, describes the coverages of the 
various national censuses. By broad areas, such as land, utilization, it indicates the 
particular coverage, if any, for each cooperating nation. For the statistician and 
economist, the most valuable contribution of the volume is its availability as a ready 
reference source to determine the specific countries that have reasonably comparable 
data on a large number of items. 

Brief reference to the other two volumes seems appropriate. Volume I, which is 
being published in loose-leaf form, includes the actual census results for each co- 
operating country. Volume III will comprise an analysis of the main subjects included 
in the census. 


Handbook of Area Sampling. John Monroe and A. L. Finkner. New York: Chilton Com- 
pany—Book Division, 1959. Pp. vii, 55. $3.00. 


Wituram G. Mavow, Stanford Research Institute 


HIs small volume consists of four chapters and an appendix. In the chapters, the 
Reases give the steps to be taken in preparing materials to be used in selecting 
an area sample, the steps to take in selecting the sample, and some of the standard 
sampling designs. The appendix contains census definitions, sources of housing data 
and aerial photographs, and a table of random numbers. 

The book is well-written, concise and should be very helpful. 

The greatest usefulness of this book will probably be to those who often make 
sample surveys, and to those who teach courses in sample surveys. The book should 
especially be useful as a brief practical procedures supplement to courses in sampling 
theory. 

It would be a little dangerous to try to use this book without a sampling back- 
ground because one can rarely follow the rules exactly, and small deviations from 
rules often produce large costs in the accuracy of information obtained from the 
survey. 


**Student’s” Collected Papers. LE. S. Pearson and John Wishart, Editors. Foreword by 
Launce McMullen. Cambridge: Cambridge University Press, 1958. Pp. vii, 224. 21S. 


HE complete collected papers of “Student” (W. 8S. Gosset), out of print for some 
time, have been reissued without change. 
D.L.W. 


Readings in Linear Programming. S. Vajda. New York: Wiley, 1958. Pp. vii, 99. Price 
unlisted. 


LioneL McKenziz, University of Rochester 


HIs is an unusual book of “readings.” It is not composed of reprints of articles, 
but of short chapters, most of which are closely dependent on one or more pub- 
lished papers. However, the chapters present this material differently from the original 
papers. They are usually much briefer and the argument is simpler. Theoretical dis- 
cussions and mathematical theorems are suppressed, while arithmetic illustrations of 
the methods furnish most of the matter. Many of the twenty-four chapters occupy 
two or three pages and present their messages economically. 
The range of subject matter might be termed classical linear programming, that is, 
the transportation problem, with shadow prices, the original simplex method, the 
method of network flow, and the use of the dual problem. Thus less fundamental de- 
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velopments, which are not so easily presented by arithmetic, are omitted, for example, 
the revised simplex algorithm and the dual simplex algorithm. 

A major emphasis lies on the preparation of problems so that they are amenable 
to linear programming. The problems include many of those which have been out- 
standing in the literature, among the general applications, production scheduling, 
personnel assignment, and warehousing, and among some less general applications, 
the caterer problem, blending of gasolines, and trim loss reduction. The earliest 
examples of linear programming, transportation, nutrition, and the airlift are there. 
There are a few theoretical chapters on the simplex method, flow through a network, 
and duality. The final chapter applies linear programming to games. 

An attempt to avoid algebra and theory in linear programming must encounter 
some difficulties. Often the arithmetical steps are not easily followed or the motivation 
is unclear. These difficulties are accentuated by the brevity of the discussions. If a 
book is too short, it takes long to read. Possibly this book should have been less 
short. The symbols used are often not defined, and obscure steps in the argument are 
not explained. If more theory had been given the argument would be more trans- 
parent to those capable of appreciating the theory and it can be doubted that others 
will have the patience to read the book anyway. 

A particular complaint that I would like to make is the absence of economic con- 
cepts. Maximizing or minimizing an objective function which has practical interest 
is, after all, economics. We have in the popular mind of our culture a wealth of eco- 
nomic common sense which it is foolish to ignore, as it would be foolish to ignore com- 
mon sense notions in mechanics or optics. If the simple idea of avoiding unprofitable 
activities is exploited in conjunction with imputed prices, much of linear programming 
becomes intuitive to common sense, and the structure of the algorithms is easily 
grasped. Moreover, the economist has a suspicion that the failure to make use of 
economic common sense has retarded the development of the subject. Certainly the 
recent innovations, the methods which combine primal and dual algorithms and the 
threshold method can be rationalized by the economic interpretation. 

The value of economic interpretation can be seen in the application of the method 
of network flow to the transportation problem. The point of this approach is to select 
a subset of transportation routes (activities) and select shadow prices on goods at 
plants and warehouses so that the cost of the good at the plant plus the cost of trans- 
port to the warehouse is just met by the price of the good at the warehouse for the 
routes chosen, while other routes show losses, or at least no profits, on shipments. 
Then the flow over the chosen routes is made as large as possible. If the whole pro- 
gram of shipments is achieved, the cheapest program has been found. If there are 
surpluses at plants, and thus deficiencies at warehouses, we do what economic com- 
mon sense suggests. Raise the prices at warehouses with deficiencies and lower them 
at plants with surpluses until at least one new route can defray its costs. If the in- 
creases and decreases are equal, all the routes formerly used are still available. Thus 
total shipments are not reduced. This is continued until additional shipments cause 
the surpluses and deficiencies to disappear. 

Economists have known for a long time that what is produced by (linear) activities 
whose revenues equaled costs, when other (linear) activities do not achieve an excess 
of revenue over cost, is produced efficiently. Thus cost must be minimized in such cir- 
cumstances (or revenue maximized in the converse problem). The only question that 
arises then is one of feasibility. A similar procedure is now available for the general 
linear programming problem in the primal-dual algorithms. Of course, the original 
simplex algorithm may also be rationalized by an economic argument. However, this 
method is more primitive from the economic viewpoint. 
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The movement toward economics has been completed in the threshold method. 
For example, in the transportation problem, the warehouses are endowed with de- 
mand functions causing them to spread their purchases over the plants, preferring 
plants who supply at lower prices. Then the prices initially set at the plants are in- 
creased where demand exceeds supply and lowered where the opposite holds until 
equality is achieved. If the demand functions are properly designed, and adjustments 
are not too large, the iteration is convergent. Economists have studied the con- 
vergence of market processes, so even this question is not foreign to the economic 
literature. Next the demand functions are refined to reduce the spread of purchases 
until a satisfactory approximation to the minimum cost allocation is attained. The 
virtue of this method is its ability to handle very large scale problems. 

It is striking that the development of good computational algorithms has paralleled 
the approach to a market analogue. It seems to justify the use of economic common 
sense also in elementary expositions. 

Some apparent misprints: p. 11, 1. 17, “=="?; p. 13, 1. 19, x’s omitted?; p. 13, 1. 6 up, 
“B* for p. 14,1 §, 2{, omitted in first equation and equation for primed 
variables omitted; p. 67, 1. 12, “1” should be “2”; p. 67, 1. 21, “y:” should be “y,”; 
p. 71, tables “5” should be “1” in first column, second row; p. 81, 1. 2 up, “third” 
should be “first”; p. 86, table, “12” in fourth row, fourth column, not in fourth row, 
sixth column. 


Economics of Mental Illness. Joint Commission on Mental Illness and Health Mono- 
graph Series No. 2. Rashi Fein. New York: Basic Books, Inc., 1958, Pp. xx, 164. $3.00. 


Mark 8. Biumsere, Stanford Research Institute 


HIs is the second volume of a projected ten-volume series to be published by the 
Joint Commission on Mental Illness and Health. The research was supported by 
the generous U. S. Mental Health Study Act of 1955. 

The first chapter is a brief introduction. For this reviewer the high point of the book 
was chapter II entitled “The Concept of Costs.” It is a 15-page essay on the dif- 
ficulties of establishing a useful and acceptable set of criteria to determine the eco- 
nomic cost of any disease. Many of the assumptions presented in these pages are 
debatable but their presentation is valuable to stimulate thought on the subject. The 
ideas of Malzberg, Reynolds (of England), Dublin and Lotka are discussed in regard 
to the economic value of women, unemployment-oriented economists, and the con- 
cepts of economic loss to the nation versus loss to the family. 

Chapter III considers the direct costs of mental illness. Those dealing with the 
cost of care for institutionalized patients and outpatients of public institutions are 
straightforward and use previously published and generally available data particu- 
larly those from the Interstate Clearing House on Mental Health (Council of State 
Governments), the American Hospital Association, and various Federal Agencies. 

Expenditures on private physicians for mental illness are very sketchy and indicate 
the need for better data, although it is doubtful that some of the then existing and 
pertinent data resources were used by the author. For example, the author quotes and 
uses (p. 45) an estimate that “approximately 50 per cent of patients who are treated 
in general practice have psychiatric complications.” In the book, Why Patients See 
Doctors, 8. Standish, Jr., et al., University of Washington Press, Seattle, 1955, it 
was reported that 2 per cent of visits to general practitioners were for mental 
conditions. 

Chapter IV, “Some Measures and Statistics of Indirect Costs” is perhaps the weak- 
est in the book despite the authors evident interest in the theoretical aspects of the 
matter given in chapter II. In essence the number of work-years lost by institutional- 
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ized patients (by sex) are estimated and then multiplied by the annual median (not 
mean) wages for males and for females. This procedure is used in Tables 4.6, 4.8, 
4.10, 4.13. Because the author chose to use median wages instead of mean wages an 
unknown but possibly substantial error may have been introduced. The skewed dis- 
tribution of incomes in the U. S. results in the mean being substantially above the 
median. (For family units in the U. 8. the mean is about 20% above the median.) 
Since the very wealthy may also become mentally ill, there seems to be no valid 
reason for omitting their income in effect by using median figures. 

The author wished to use the median wage for each age group of hospitalized 
patients but did not. Even though such data were not available, it might have been 
valuable to make estimates (p. 62). The age distribution of mentally ill patients is sub- 
stantially different from those of wage earners, and this source of error from failing 
to make an estimate could also be large. 

Although anyone working with masses of data may tend to overlook basic rules 
regarding significant figures, the author’s inattention to these matters is frequently 
amazing. In table 4.6 (p. 65) dollar values are given to nine significant places which 
are based on a preceding table where percents with two significant places are used. 
Again on p. 69, “The total wage loss due to prolonged absence is $59,440,734. This 
is, of course, a very rough estimate, ... ” 

If data from the National Health Survey had been available to the author they 
could have been used in lieu of the tedious estimates of work absence of non-institu- 
tionalized people from mental illness based on data from the Research Council for 
Economic Security. Data from the California Health Survey published in 1957 might 
have been used by the author for these and other purposes to good advantage. 

Chapters V and VI deal primarily with an attempt to assign economic costs by 
specific diagnoses. Man power losses from various prolonged illness are presented. 
Here again, data from the National Health Survey tend to make some of these find- 
ings obsolete. 

Chapter VII, Conclusions, is written with the same skill and spirit of Chapter II. 
It is a thought provoking essay almost completely devoid of numbers. 

All in all the book’s best features are two chapters of thoughtful speculation on 
the meaning of economics for mental illness. Although the author is generally cautious 
about the validity of his procedures, the Director of the Study, Dr. Ewalt is less so 
and on p.x. of the Staff Review notes, “The essential value of his study is to provide 
sound methods of computing the costs of mental illness. On this basis, his report 
should be of lasting value to any branch of federal or state government that would 
like to know how to develop reliable and meaningful cost figures .. . ”. 

This is a tall order and the book does not fill it. Although Dr. Ewalt may believe 
that the book concentrated on developing a sound and lasting system for estimating 
cost figures, it is apparent that the author concentrated on getting something out 
of existing data without delineating the specific type of cost data which ought to be 
developed in the future. 


United Nations Demographic Yearbook 1958 (Tenth Edition). Statistical Office of the 
United Nations. New York: Columbia University Press, 1958, Pp. viii, 541. $6.50 Paper. 
$8.00 Cloth. 


Grorce W. Barctay, International Business Machines Corporation 
4 ee volume adds to the collections of population data (the tenth) published by 
the United Nations since 1945. As is the custom in this series, special emphasis is 
given to a single topic—in this case, marriage and divorce statistics (with one table 
showing annulments in a few countries). This is not the first of the Demographic 
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Yearbook series to include statistics of marriage. Here for the first time, however, 
marriage is the chief topic of the special tables. 

The issuance of the Demographic Yearbook is a vital event familiar and welcome, 
by this time, in all quarters where demographic data are used. In this issue, certain 
customary tables are brought up to date—notably the series of births, deaths, mar- 
riages, divorces, yearly population estimates and vital rates, and age-sex distributions 
of population, incorporating new data from a few recent censuses. These are, of course, 
necessary to keep the Yearbook fully current. Hands down, it is the most extensive 
collection of national (and quasi-national) data available. 

The tables of marriages this time include a recent series of annual figures by age of 
bride and groom; sex- and age-specific marriage “rates” related to the “marriageable 
population” (defined as everyone not currently married); marriages by previous 
marital status of both spouses; and marriages by age and previous marital status of 
spouses. An effort was made to provide these data for periods around the time of a 
recent census. There are also census tables showing population by age and marital 
status; and a table showing deaths by age and marital status. 

It is apparent that the practice of emphasizing a special topic in each issue (which 
has greatly enhanced the value of these volumes) has a disadvantage: the supply of 
new topics is running low. Statistics of marriage do not lend themselves to inter- 
national comparison, nor to abridgement in the manner required for a one-volume 
approach. Marriage is just not a very appropriate topic for yearbook treatment, and 
the principal shortcomings of the work are attributable to this fact: 

1. Many of the data are very poor. The Yearbook is under the handicap of being 

obliged to publish what is furnished by countries replying to its questionnaires. 
This is a severe handicap in the case of marriage statistics, and the variations 
in quality of data are not adequately reflected in the evaluation codes that are 
attached. 

. Of the countries that publish some statistics of marriages, relatively few provide 
those tables that are really needed. There is a high degree of attrition, as one 
proceeds from crude data to the useful kinds of detail. Of 173 geographic units 
providing annual numbers of marriages, only 114 showed marriages by age of 
man and wife, and just 79 included tables of first marriages by age. 

. Tables normally available are rarely adequate for analyzing the patterns of 
marriage in any rigorous fashion. This is also true of the Demographic Yearbook; 
only the last table of marriage statistics (by age and previous marital status of 
spouses) will bear any very close study. Altogether they constitute a strong 
argument for the use of census data. 

. The explanatory text (at the beginning) on “Uses of Marriage Statistics” is 
very brief. The problems of analysis cannot be covered at this length, and indeed 
only the most casual view of the uses for such data is conveyed by the text. 

What can be done with such a collection? Unfortunately, not very much. The 
tables give a taste of what is to be had in the reports of individual countries, but 
only a faint flavor of any one case; and a very limited basis of comparison between 
populations. The limitation is not a fault of the compilers, who have excelled in a 
generally arduous and thankless task. The trouble really lies in the nature of marriage 
statistics: the user must know a bit more about the subject than about most kindred 
topics of demography, and more than he can expect to find in a yearbook. 

The main service of the volume is to gather enough material to suggest the possi- 
bility of further studies. It also warns of certain limitations—where such studies run 
out of data. The material is quite ample for this purpose. It will be most useful to 
those who are prepared to go a long way beyond this stage. And this, after all, is 
the ground on which a yearbook should be judged. 
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Water Witching, U.S.A. Evan Z. Vogt and Ray Hyman. Chicago: University of Chicago 
Press, 1959. Pp. xi, 248. $4.95. 


R. Buytu, University of Illinois 


HIs is an excellent book, and one which gives the reader a real understanding of 

the phenomenon of water witching. It is written in an excellent and entertaining 
style, and is to be highly recommended for general recreational reading. It is an 
open-minded inquiry: instead of putting water witching on trial and building up a 
case for prosecution or defense, the authors invite the reader to examine the whole 
phenomenon without prejudice and determine the facts. 

The book can be outlined as follows: (a) What is water witching? A description of 
origins, history and present methods. (b) Does it work? The answer is no. A general 
discussion of methods for interpreting experimental evidence is given. The authors 
carefully explain why the available evidence does not demonstrate that witching 
helps locate water, but on the contrary demonstrates beyond any reasonable doubt 
that witching is completely ineffective. (c) What makes the rod move? An explana- 
tion of how the witcher turns the rod by unconscious muscular action in response to 
suggestion, while honestly believing that he is resisting its motion. (d) Present prac- 
tice, U.S.A. A discussion of who does water witching, where, under what conditions, 
the extent of the practice, and reasons why it continues to be done. This discussion 
is based on answers to a questionnaire sent by the authors to a sample of county 
agricultural extension agents in the U.S. 

Of most direct professional interest to statisticians is the section (b) on the inter- 
pretation of experimental evidence. The following errors are introduced in a rather 
parenthetical manner and detract only negligibly from the otherwise excellent dis- 
cussion. 

On page 64 it is asserted that the scientific approach requires a long range attack 
and cannot be used when a quick decision must be made using scanty evidence. This 
is quite false—there is a large body of scientific method (known as statistical de- 
cision theory) for this very purpose. The asserted sharp distinction between decision 
methods using good evidence and decision methods using poor evidence does not 
exist. This error is repeated on page 194 where it is falsely asserted that rational de- 
cision making is impossible when the evidence is scanty. 

On page 195:an example is given in which the concept of rational behavior in gener- 
al is confused with a special definition (maximizing expected gain) of rational be- 
havior. That this special definition is not appropriate in the example is not an argu- 
ment against rational behavior, but only against this special definition. Some other 
definition (such as maximizing median gain, or minimizing probability of a gain less 
than a specified amount) will be appropriate. The experimenter must first decide on 
what he wants his decision rule to achieve; this defines “rational behavior” for him. 


> 


PUBLICATIONS RECEIVED 


Alexander, Sidney S., Colm, Gerhard, 
eng Neil H., Shere, Louis, Roosa, 
obert V., Slichter, Sumner H., Massel, 
Mark S., and Hagen, Everett E. Econo- 
mics and the Policy Maker, Brookings 
Lectures, 1958-1959. Washington, D. C.: 
The Brookings Institution, 1959. $2.95. 
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New York, New York: Federal Reserve 
Bank, 1959. $0.50. 

Buckland, Wm. R., Editor. International 
Journal 4 Abstracts. London, England: 
Oliver & Boyd for the International Sta- 
tistical Institute, 1959. Annual Subscrip- 
tion, L5 or $16.00; single copies, 30s or 


$4.50. 
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Ferber, Robert. Collecting Financial Data 
by Consumer Panel Techniques. Uni- 
versity of Illinois: Bureau of Economic 
and Business Research, 1959. $2.50. 

Ferguson, George A. Statistical Analysis in 

sychology and Education. New York, 
New York: McGraw-Hill, 1959. $7.50. 

Garvy, George. Deposit Velocity and Its 
Significance. New York: New York: 
Federal Reserve Bank, 1959. $0.60. 
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Woodridge, Dean E. Handbook of Auto- 
mation Computation and Control, Volume 
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York, New York: Arco Publishing Co., 
Inc., 1959. $5.00. 

Gutman, Robert. Birth and Death Registra- 
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York, New York: Milbank Memorial 
Fund Quarterly, 1959. $1.00. 

Hogg, Robert V. and Craig, Allen T. Intro- 
duction to Mathematical Statistics. New 
York, New York: The Macmillan Com- 
pany, 1959. $6.75. 

Hogland, C. R., Johnson, Glenn L., 
Lassiter, Charles A., and McGilliard, 
Lon D., Editors. Feed Utilization by 
Dairy Cows. Ames ,- Iowa: Iowa State 
College Press, 1959. $3.95. 

Hall, Marshall, Jr. The Theory of Compe. 
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Methods and Applications. New York, 
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PUBLICATIONS 


Journal of the American Statistical Association—The Board announces with pleasure, 
following approval by the Council, the appointment of Clifford G. Hildreth of Michigan 
State University as Editor of the Journal. W. Allen Wallis, who served from 1950 to 
1959, resigned because of the pressure of other duties. David Wallace of the University of 
Chicago kindly filled in as Acting Editor until a special committee located a new editor. 
The Board, on behalf of the entire Association, extends its thanks and appreciation to 
Mr. Wallace for his help during this interim period. To Allen Wallis we think the best 
tribute is to point to the Journal itself. The growth in stature, as well as the increases in 
circulation, reflecting the broad reaches of statistics into so many professional fields, 
speak most eloquently of Mr. Wallis’ accomplishments as Editor. With Dean Wallis go 
our warmest thanks and gratitude for a splendid job. We welcome Mr. Hildreth and look 
forward to working with him. Finally, a special commendation to the committee members 
who worked so diligently to select a new editor: Walter E. Hoadley, Jr., Ralph J. Watkins 
and William G. Cochran, past Presidents and Almarin Phillips, Chairman of the Publica- 
tions Policy Committee. 

Technometrics—1959 has been the first full year for the new journal, Technometrics, 
published jointly with the American Society for Quality Control. This new publication, 
dealing with statistics in the physical, chemical and engineering sciences, is published 
quarterly. Four issues (February, May, August, November) have been printed, covering 
volume one. A detailed financial report on the past year’s operations is appended to the 
Report of the Secretary-Treasurer. Mr. Riley serves as Treasurer for Technometrics. A 
Management Committee, with three appointees from each society (ASA and ASQC) and 
chaired by Paul Olmstead, has the responsibility of guiding the general operations of the 
new journal. They are: (from ASA)—Irving Burr, Almarin Phillips and Donald C. Riley; 
(from ASQC)—J. Y. McClure, Maynard Renner and H. L. Wehrly. The editor is J. Stuart 
Hunter, with assistance from a board of associate editors. 

Index to the Journal, 1940-1955—The new Index was completed at the end of this year. 
This large project was done with the aid of grants from the Ford Foundation and National 
Science Foundation at the Virginia Polytechnic Institute by members of the Department 
of Statistics: Charles Clunies-Ross, William A. Glenn and Leroy Brenna. The Department 
is headed by Boyd Harshbarger. The Index covers thoroughly all authors, subjects by 
cross-classification and book reviews in volumes 35 through 50. This will be a most valuable 
reference work for all users of the Journal and the Association is indeed indebted to the 
Ford Foundation, the National Science Foundation and the Virginia Polytechnic Insti- 
tute. 

The American Statistician—Since 1952 The American Statistician has been edited by 
Almarin Phillips and then by Morris Hamburg. (Currently, Mr. Phillips is Acting Editor, 
while Mr. Hamburg is on a one year leave of absence at Harvard). Under their guidance 
the magazine, which carries articles and news of interest to the statistical world, has in- 
creased in quality as well as size. 

The various departmental editors share the accolade: Dana M. Barbour, News and 
Notes Editor, Ernest Rubin in charge of the Questions and Answers feature and Joan R. 
Rosenblatt, editor of the section on International Statistical Activities. Virginia Venneman 
has been most helpful on information concerning Federal statistical activities. 

Proceedings—The 1958 Proceedings of the Social Statistics Section was in the nature of 
an experiment, similar to that of the Proceedings of the Business and Economic Statistics 
Section four years earlier. The eight hundred copies printed have been sold. The Board 
has recommended to the Council that a 1959 edition be autigeiteen. The Proceedings of 
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both Sections contain papers given at the Annual Meeting in sessions sponsored by the 
Section. 

The Proceedings of the Business and Economic Statistics Section are just about self- 
supporting. The price is set to recover direct cost, as is the case with the Social Statistics 
Proceedings. 


ACTIVITIES 


Chapters—The thirty-ninth active chapter of the Association was chartered this year. 
The Nebraska Chapter joins the growing list of local groups holding regular meetings, 
conferences, activities, etc. Regional Conferences (one- or two-day meetings) were spon- 
sored by a number of chapters, usually in cooperation with other professional groups. De- 
tails of the conferences appear in The American Statistician. 

The President asked Howard L. Jones, a Vice-President of ASA, to head a new com- 
mittee on relationships between chapters and the national association. The committee, 
among other things, will investigate a series of suggestions from the Southern California 
.Chapter. Findings and recommendations will be reported to the membership in due course. 

Committees—The Committee on Committees, chaired by Nathan Keyfitz, has begun 
preliminary studies on the Constitution of ASA, in preparation for the revision required 
within the next few years. The current Constitution was adopted January 1, 1954 and 
must be reviewed before ten years have passed. The Committee on Committees also has 
the responsibility of reviewing charters and statements of purpose of all committees and 
sections of the Association, with recommendations to the Board. 

The final report of the Committee on Publications Policy, following its study of the 
opinions of the membership of the Association’s publications, appeared in the December, 
1959 issue of The American Statistician. We urge the members to read this interesting 
report. Comments will be most welcome. This report does not include recommendations 
to the Board—these will be presented separately by the Committee and announcements of 
actions taken will, as usual, appear in The American Statistician. 

Other Areas—Committees of the Association have been proposed in two areas of 
activity: 1) Audio-visual aids and educational TV and 2) the use of statistical procedures 
in auditing and other accounting methods. On item 2) discussions have been held with 
representatives from the accounting field, and further study in this whole area has been 
recommended. On item 1) it has been suggested that a comprehensive report on the proper 
use of various aids in the teaching of or training in statistics would be most valuable to 
those involved in educational methods. The Section on Training has been following this 
area closely. 

The Association is continuing to collaborate with related societies and special arrange- 
ments will be worked out with various groups from time to time as they appear feasible. 

Annual Meeting—The 1959 meeting in Washington was the largest ever held by the 
Association. The meeting was held jointly with a number of other societies. The press 
room, publicity and public relations generally were handled more thoroughly this year. 
A consultant on public relations, in connection with the meeting, has been retained by 
the Association. 

The 1960 Annual Meeting has been scheduled for August 23-26 on the campus of 
Stanford University, Palo Alto, California. This is a departure from past practice both 
in meeting on the west coast and at a university. Requests from a large segment of the 
membership of the Association have led to this meeting on a campus. Other future meeting 
dates now scheduled are: 


New York City December 27-30 
Minneapolis Early September 
Cleveland Early September 
Chicago December 27-30 
Philadelphia Early September 


Travel Grants—1960 will be the last year under the three-year arrangement of travel 
grant funds from the Carnegie Corporation. The Board has recommended a request for 
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1964 
| 1965 
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renewal of the program, in concert with other societies who also received funds for this 
purpose. A number of international meetings have been announced by the Committee on 
Travel Grants as being eligible for inclusion on applications. Announcement of recipients 
of grants for travel to these meetings will be made in The American Statistician. 


REPORT OF THE SECRETARY-TREASURER, 1959 


The 1959 addition to surplus comes to about $8,500—a little above average for the 
Association over the last ten years. However, while this excess of income over expense is 
not unusual despite the fact that we have had some lean years, it is notable in the light of 
constantly rising costs without any increase in the dues or subscription rates for the past 
twelve years. For several years the Association’s budgetary requirements have been esti- 
mated to a degree of closeness between income and expense, where a not unusual increase 
in any one of a number of expense items would have meant a loss for the year. Fortunately 
this has happened only once since 1950. However, the once-expressed aim of securing a 
surplus equal to one year’s income seems destined not to be realized since the increases in 
budget have been consistently higher than the average yearly addition to the surplus, 
which is now just under $58,000. 

Ten years ago the Association had approximately 4,300 members (at the end of 1949). 
At the end of 1959 the membership roll stands at 7,000 (see details below). In those ten 
years the publishing activities have expanded greatly. Aside from the regular periodicals 
(the Journal and The American Statistician) there has been a number of special publica- 
tions. 

The increases in income from more members and the same factor in subscriptions from 
the Journal have played a large part in supporting the expansion in Association activities. 
But other items have helped: the increase in advertising income both from more ads and 
from higher rates, after a study extending over several years; small increases in interest 
income as more cumulative surplus became available for placement in savings and loan 
associations, U. S. Government bonds, and other investments; and miscellaneous other 
items reflecting the growth of the membership or increase in charges. The foregoing, 
coupled with strict attempts to economize on expense without hampering the publications 
or services provided, has resulted in the small yearly additions to surplus. We must realize, 
nevertheless, that constant inflation may upset this balance, and be prepared to act ac- 
cordingly. 

The amount added to the cumulative reserve at the end of 1959 is $8,500. Total income 
for the full year was $95,000, while exrenses took just about $86,500. The income figure is 
somewhat higher than the amount budgeted: $88,000. $87,000 was budgeted for expense 
for 1959. The detailed audit by the accounting firm of Alexander Grant and Company 
which follows this report gives the breakdown on all items for comparison. 

The 1960 proposed budget shows the expected continued rise in both income and ex- 
pense for the coming year. The budgeted amounts are $98,850 for income and $98,100 
for expense, being an approximately balanced budget, if there is no unforeseen rise in ex- 
penses. The 1960 budget allows a modest increase for the printing of the Journal and The 
American Statistician. Small increases over 1959 are planned for several expense items: 
salaries, supplies and office expense, telephone, etc. One item appearing this year for the 
first time is indicative of the improvement in the financial situation: officers’ travel. For 
some time the Association has discussed the possibility of reimbursing Board Members for 
travel, as is the case in many other associations. $2,500 is therefore included in the budget 
for reimbursement of Board members only, and this only for those who do not have a 
ready alternate source for travel expense. Travel for Board members when the meeting 
occurs during the Annual Convention is not included. Whether this system could be main- 
tained each year will depend on the existence of a stable Association income. 

It is gratifying to report that membership passed the seven thousand mark in 1959. 
As in 1958, the number of new members was much higher than before: over eleven hundred 
persons joined in the past year. Our breakdown shows the following: at the beginning of 
1959 there were 6,325 regular, student and family members (including 107 Institutional 
representatives, considered as regular members, from 39 companies and organizations). 
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1,140 new members joined and 23 past members were reinstated. 148 names were removed 
from the list through resignations and deaths and }# were dropped for non-payment of 
dues. The resulting total at the beginning of 1960 is 7,026 members. 

The Finance Committee recommended this year that the Association’s investments be 
divided into three categories with approximately one-third of the funds placed in each: 
1) insured savings and loan assvciations; 2) U. S. Treasury bonds; 3) Mutual investment 
trusts. In keeping with this study, shares were purchased in three trusts, as follows: 
200 shares of Tri-Continental Corporation; 300 shares of Niagara Corporation; and 300 
shares of Consolidated Investment Trust. The aim of these purchases is to provide the 
Association with “growth” stock, though it is expected eventually that they will pay as 
much as or slightly more than the four to four and one-half per cent paid annually by 
savings and loan institutions. 

In general it has been a good year and many new activities have been undertaken. 
Many more things could be done with more funds and manpower. We stretch ourselves 
very thinly at times. At a time when the field of statistics is being increasingly recognized 
perhaps the Association’s biggest shortcoming is its unorganized placement service. This 
takes time and effective manpower. I hope to be able to suggest some steps in this area 
soon. 

Donatp C. RiLEy 
Secretary-Treasurer 


AUDITORS’ REPORT 
AMERICAN STATISTICAL ASSOCIATION 
DercEMBER 31, 1959 


ALEXANDER GRANT & COMPANY 
CrertTIFIED Pusiic ACCOUNTANTS 
1025 Connecticut AVENUE 
Wasuineron 6, D. C. 
Board of Directors 
American Statistical Association 
Washington, D. C. 

We have examined the balance sheet of the AMERICAN STATISTICAL ASSOCIATION (a non- 
profit organization) as of December 31, 1959, and the related statement of income and 
association equity for the year then ended. Our examination was made in accordance with 
generally accepted auditing standards and accordingly included such tests of the account- 
ing records and such other auditing procedures as we considered necessary in the circum- 
stances. 

In our opinion, the accompanying balance sheet and statement of income and associa- 
tion equity present fairly the financial position of the American Statistical Association at 
December 31, 1959, and the results of its operations for the year then ended, in conformity 
with generally accepted accounting principles applied on a basis consistent with that of the 
preceding year. 

Comments in regard to the scope of our examination and details of certain items shown 
in the balance sheet and statement of income and association equity are presented in the 
following paragraphs. 

BALANCE SHEET CoMMENTS 
Cash 

Balances comprising the total cash of $29,628 at December 31, 1959, are as follows: 
American Security & Trust Company 

Operating account 

Special account 
Petty cash 
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Cash in banks and funds on deposit with savings and loan institutions were confirmed 
directly with the depositories. At December 31, 1959, cash in the amount of $20,929 was 
restricted for the unexpended balances of grants (see grant account analysis) and by 
the amount received from the Business and Economic Section of A.S.A. to be applied 
against the cost of future regional meetings. 


Investments 


Income from deposits with savings banks and savings and loan institutions and from 
investments in government bonds and other securities totaled $2,976. The average interest 
rate on funds invested in savings banks and savings and loan institutions for the year 
ended December 31, 1959, was approximately 4.1%. 


Accounts receivable 


The following items make up the accounts receivable total of $10,277 at December 31, 
1959: 
Advance to Technometrics 
Amounts due for advertising in December 1959 Journal 
Amounts due for advertising and exhibits at 1959 annual meeting 
Estimated amount due from Arrangements Committee for 1959 annual meeting 
Refund due from U. 8. Government, representing A.S.A. portion of taxes paid 
by mutual funds on undistributed gains 
Dividend on investments held by broker 


$10,277 


Inventories 


In accordance with past practice, the cost of 100 copies of each of the four issues of the 
Journal during 1959 was capitalized and added to the value of the inventory of past issues. 


Accounts payable—trade 


At December 31, 1959, accounts payable—trade of $20,898 included the following: 
George Banta Company (printing and mailing costs related to December 1959 

Journal) $ 4,164 
The McArdle Printing Company, Inc. (printing and mailing costs related to 

the October and December 1959 American Statistician and other printing and 

mailing charges) 
Virginia Agricultural Experimental Station and the Educational Foundation 

of Virginia Polytechnic Institute (applied against Ford Foundation grant). . 


Deferred income 


Total deferred income at December 31, 1959, of $35,488 was $1,253 less than the cor- 
responding amount at December 31, 1958. Advance payments of membership dues de- 
creased by $3,436, while subscriptions received in advance and other miscellaneous items 
had a net increase of $2,183. The reduction in advance membership dues was caused pri- 
marily by a later mailing of 1960 membership bills as compared with the prior year. 


Unexpended grants 


The Association has been the recipient of three grants. The Ford Foundation Grant was 
initiated in 1956 for the purpose of indexing the Journal of the American Statistical Asso- 
ciation for volumes 35 to 50. 

The second grant was received in 1959 from the National Science Foundation for the 
purpose of publishing the index to the Journal of the American Statistical Association. 
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The income from sales of the Journal Index for the first three years will be returned 
to the National Science Foundation. Sales of this publication amounted to $1,364 at De- 
cember 31, 1959. 

The third grant, which is called the Travel Fund Grant, was received in 1957 from the 
Carnegie Corporation to cover expenses incurred on behalf of sending delegates to inter- 
national meetings. 

An analysis of these three grants for 1959 follows: 

National 
Ford _ Science 
Foun- Foun-_ Travel 
dation dation Fund 
Grant Grant Grant 
Unexpended balance—January 1, 1959 $ — $7 ,359 
Received in 1959 13 ,000 =~ 
200 40 


$12,800 $7,319 


The Ford Foundation Grant expenses represent amounts due to the Virginia Polytechnic 
Institute at December 31, 1959, for work performed on the index project. 


INcoME AND AssocrIATION Equity COMMENTS 
Income 


Total income for the year 1959 was $10,705 higher than that for the preceding year. The 
major contributing factors to this increase were a larger number of paid members and an 
increase in advertising and subscriptions to the Journal and to the American Statistician. 


Expenses 


The increase of $5,929 in 1959 expenses as compared with 1958 was due mainly to in- 
creases in office salaries of $4,191 and publications’ expenses of $1,318. 
ALEXANDER Grant & CoMPANY 
Washington, D. C. 
March 7, 1960 


AMERICAN STATISTICAL ASSOCIATION 
CoMPARATIVE BALANCE SHEET 
DEcEMBER 31, 1959 AND 1958 


December 31, Increase 
1959 1958 (decrease) 


$ 29,628 $ 23,075 $ 6,553 
Investments (supplemental schedule below) 
Funds on deposit in savings accounts and savings 
and loan institutions 54,851 (3,223) 
17 ,667 
Investments in stocks 
Accrued interest receivable 525 
Accounts receivable 5,262 
Inventories 
3,406 
2,689 
54 
3,332 (1,021) 


$138,814 $110,861 $27,953 


Unexpended balance— December 31, 1959................ $3 — [_ 
Assets 
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Fixed assets—at cost 
Furniture and fixtures 8,026 7,993 33 
Office machines 500 —_ 500 


8,526 7,993 533 
Less accumulated depreciation 6,448 6,088 360 


2,078 $ 1,905 $ 173 


$140,892 $112,766 $28,126 


The accompanying report letter is an integral part of this balance sheet. 


AMERICAN STATISTICAL ASSOCIATION 
CoMPARATIVE BALANCE SHEET 
DeceMBER 31, 1959 AND 1958 


December 31, Increase 
1959 1958 (decrease) 
Liabilities 
Accounts payable—trade 
Chapter dues 
1,050 
474 
Philadelphia 
Due to National Science Foundation (sales of the 
Index to the Journal) — 
Subscriptions received for T'echnometrics 
Due to American Sociological Society 120 
280 
4 


$ 26,533 $13,116 $13,417 
Deferred income 
Membership dues 21,691 25,127 (3,436) 
Subscriptions 

Journal 9,579 7,397 2,182 
403 301 102 
3,348 3,449 (101) 

467 467 — 


$ 35,488 $ 36,741 $(1,253) 
Amount reserved for future regional meetings....... 811 — 811 


811 $ 8sil 

Unexpended grants 
Ford Foundation — 6,191 (6,191) 
National Science Foundation 12,800 _— 12,800 
Travel Fund Grant 7,319 7,359 (40) 


$ 20,119 $ 13,550 $ 6,569 
Association equity 57,941 49 ,359 8,582 


$140,892 $112,766 $28,126 


The accompanying report letter is an integral part of this balance sheet. 
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AMERICAN STATISTICAL ASSOCIATION 
CoMPARATIVE STATEMENT OF INCOME AND ASSOCIATION Equity 
Years ENDED DEcEMBER 31, 1959 AND 1958 


Year ended 
December 31, Increase 
1959 1958 (decrease) 
Income 
Dues —old members 
—new members 
Subscriptions—-Jcurnal 
—American Statistician 
Advertising—Journal 
—American Statistician 


—1958 Membership Directory 
— Business and Economic Section Proceedings. . 
—Journal Reprints 
—Social Statistics Proceedings 
—Other sales 
Mail list income 
Interest income 
Gain on investments in stocks 
Miscellaneous 


Total income 


Expenses 


Depreciation 

Supplies and office expense 
Postage and delivery 
Telephone and telegraph 
Accounting services 
Clerical services 

Repairs and maintenance 
Insurance 


Promotion activity 
Committee expense 

Travel and secretary’s expense 
Annual meeting (net) 


49,359 45,553 


$57,941 $49,359 $ 8,582 


The accompanying report letter is an integral part of this statement. 
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—American Statistician................000008 113 149 (36) 
485 405 80 
4,642 4,431 211 
1,112 35 1,077 
1,639 — 1,639 
174 301 (127) 
2,441 2,194 247 
2,789 3,066 (277) 
187 _ 187 
151 _ 151 
4,242 3,759 483 
2,339 2,979 (640) 
993 1,150 (157) 
Publications (supplemental schedule below)........ 44,010 42,692 1,318 
Total $86,491 $80,562 $ 5,929 
Association Equity—Beginning of Year............... EZ 3,806 
Association Equity—End of Year.................... EE 
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Fixed assets—at cost 
Furniture and fixtures 8,026 7,993 
Office machines 500 —- 


8,526 $ 7,993 
6,448 6,088 


$ 2,078 $ 1,905 $ 173 


$140,892 $112,766 $28,126 


The accompanying report letter is an integral part of this balance sheet. 


AMERICAN STATISTICAL ASSOCIATION 
CoMPARATIVE BALANCE SHEET 
DeEcEMBER 31, 1959 anv 1958 


December 31, Increase 
1959 1958 | (decrease) 
Liabilities 
Accounts payable—trade $ 20,898 $ 11,188 $ 9,710 
Chapter dues 


Washington, D. C 
Philadelphia 
Due to National Science Foundation (sales of the 
Index to the Journal) 
Subscriptions received for Technometrics 
Due to American Sociological Society 
280 
4 


$ 26,533 $13,116 $13,417 
Deferred income 
Membership dues 21,691 25,127 (3,436) 
Subscriptions 

9,579 7,397 2,182 
403 301 102 
3,348 3,449 (101) 

467 467 _ 


$ 35,488 $ 36,741 $(1,253) 
Amount reserved for future regional meetings 811 — 811 


811 $ 8il 

Unexpended grants 
Ford Foundation 6,191 (6,191) 
National Science Foundation 12,800 _ 12,800 
Travel Fund Grant 7,319 7,359 (40) 


$ 20,119 $ 13,550 $ 6,569 
Association equity 57,941 49 ,359 8,582 


$140,892 $112,766 $28,126 


The accompanying report letter is an integral part of this balance sheet. 
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AMERICAN STATISTICAL ASSOCIATION 
CoMPARATIVE STATEMENT OF INCOME AND AssocIATION Equity 
Years Enpep DreceMBER 31, 1959 AND 1958 


Year ended 
December 31, Increase 
1959 1958 (decrease) 
Income 


Dues —old members $48,795 $44,358 
—new members 6,984 
Subscriptions—J ournal 14,681 
—American Statistician 
Advertising—Journal 
—American Statistician 
Sales—Journal 
—American Statistician 
—1958 Membership Directory 
—Business and Economic Section Proceedings. . 
—Journal Reprints 
—Social Statistics Proceedings 
—Other sales 
Mail list income 
Interest income 
Gain on investments in stocks 
Miscellaneous 


Total income $10,705 


Expenses 


4,191 
35 


Depreciation (17) 
Supplies and office expense 483 
Postage and delivery (640) 
Telephone and telegraph 

Accounting services 

Clerical services 

Repaire and maintenance. .. 

Insurance 


Promotion activity 
Committee expense 

Travel and secretary’s expense 
Annual meeting (net) 


Total expenses $80,562 $ 5,929 


Net Income 3,806 4,776 
Association Equity—Beginning of Year 49,359 45,553 3,806 


Association Equity—End of Year $57,941 $49,359 $ 8,582 


The accompanying report letter is an integral part of this statement. 
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$ 4,437 
(197) 
- 1,227 
49 
1,451 
464 
122 
(36) 
80 
211 
1,077 
1,639 
(127) 
247 
(277) 
187 
151 
716 
Publications (supplemental schedule below)........ 44,010 42,692 1,318 
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AMERICAN STATISTICAL ASSOCIATION 
CoMPARATIVE SCHEDULE or AcTUAL INCOME AND EXPENSES WITH BuDGET 
Year ENpED DEcEMBER 31, 1959 


Year ended 
December 31, 1959 
Actual Budget 
Income 

Dues—old members 

—new members 
Subscriptions—Journal 

—American Statistician 
Advertising—Journal 
—American Statistician 

Sales—Journal 

—American Statistician 

—Social Statistics Section Proceedings......... 

—Business and Economic Statistics Section Pro- 

ceedings 

—Other (including reprint sales) 
Mail list income 
Interest income 
Annual meeting—net (loss) 
Miscellaneous 


$ 5,966 
Expenses 
Salaries 1,241 
Pension plan (191) 
Publications—Journal printing and reprints (2,740) 
—Journal editorial 3 
1,431 
—Directory-net (1,268) 
—Social Statistics Section Proceedings. . 400 
—Business and Economic Statistics Sec- 
tion Proceedings 
Promotional activity 


Travel and secretary 
President’s travel 
Supplies and office expense 
Postage and shipping charges 
Telephone and telegraph 
Accounting services 
Committee expenses 
Miscellaneous—depreciation 
—dues and subscriptions 
~—taxes 


$(1,616) 


$ 7,682 


410 
Actual 
over 
(under) 
budget 
$ 1,295 
(213) 
4 408 
76 
1,227 
811 
139 
2 1,142 
887 
441 
(1,572) 
287 
— 
ines 528 (672) 
742 
—repairs and maintenance........... 102 (98) 
291 41 
$85,434 $87,050 
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INVESTMENTS 
DrEcEMBER 31, 1959 


Cost or Income 
amount for year 
invested ended Effec- 
at Quoted Decem- _ tive 
December market ber 31, rate of 
31, 1959 value 1959 interest 


Savings Accounts 


City Federal Savings and Loan Association.... $10,000 
Fidelity Federal Savings and Loan Association. © 10,274 
Guardian Savings and Loan Association 

Hyattsville Building Association 

Liberty Savings and Loan Association 

Marshall Savings and Loan Association 

Piedmont Federal Savings and Loan Association 
Trans-Bay Federal Savings and Loan Associa- 


Jefferson Federal Savings and Loan Association 


U. S. Government Bonds 


U. S. 24 savings bonds, series G, due 1962 $3,035 $ 77 
U.S. Treasury 24% bonds, $10,000 face amount, 

due November, 1961 9,556 
Federal Land Bank 4% bonds, due May 1962. . 4,844 


$17,435 


Investments in Stocks 


Tri-Continental Corporation $ 7,875 
Niagara Share Corporation 6,488 
Consolidated Investment Trust 5,850 


$20,442 $20,213 $ 187 


In addition to the investment income listed in this schedule, $153 was received from de- 
posits in a savings account which was closed during the year. 


$ 400 
407 
200 
64 
160 
250 
399 
SC 840 29 
$51,628 $2,109 4.1% 
2.5% 
2.6 
4.0 
$17,667 #527 3.0% 
$ 4 
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AUDITORS’ REPORT 
TECHNOMETRICS 
DeEcEMBER 31, 1959 


ALEXANDER GRANT & COMPANY 
CERTIFIED PuBLIc ACCOUNTANTS 
1025 Connecticut AVENUE 
6, D. C. 
Management Committee 
Technometrics 
Washington, D. C. 


We have examined the balance sheet of Technometrics (a non-profit organization) as of 
December 31, 1959, and the related statement of income and equity for the year then 
ended. Our examination was made in accordance with generally accepted auditing stand- 
ards and accordingly included such tests of the accounting records and such other auditing 
procedures as we considered necessary in the circumstances. 

In our opinion, the accompanying balance sheet and statement of income and equity 
present fairly the financial position of Technometrics at December 31, 1959, and the results 
of its operations for the year then ended in conformity with generally accepted accounting 
principles consistently applied. 

Comments in regard to the scope of our examination and details of certain items shown 
in the balance sheet and statement of income and equity are presented in the following 
paragraphs. 

BALANCE SHEET COMMENTS 
Cash 

Cash at December 31, 1959, in the amount of $8,284 was confirmed directly with the 

depository, American Security and Trust Company. 


Savings account 


The savings account balance of $7,070 at December 31, 1959, was confirmed with the 
depository, the Riggs National Bank. 


Accounts receivable 

The following items make up the accounts receivable total of $4,567 at December 31, 
1959: 
Unremitted subscriptions 

American Statistical Association 

American Society for Quality Control $3 ,062 


Advance to Princeton University 
Amounts due from sales of reprints 
Advertising in Technometrics for November 1959 


Accounts payable 

At December 31, 1959, accounts payable aggregating $4,256 included the following 
items: 
William Byrd Press—(cost of printing the November 1959 issue of Technometrics) $3,092 
University of Wisconsin 
American Statistical Association 
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Deferred subscriptions 


Deferred income totaling $4,860 at December 31, 1959, represents subscriptions re- 
ceived applicable to future issues of the publication, Technometrics. 


CoMMENTS ON INCOME AND Equity 
Income 


Of the total income of $15,848 for the year ended December 31, 1959, subscriptions to 
Technometrics amounted to $13,800. The Chemical Division of the American Society for 
Quality Control and the Business and Economics Section of the American Statistical 
Association made donations of $750 each in 1959. 


Expenses 


Total expenses for the year ended December 31, 1959, amounted to $14,972. This was 
made up principally of the cost of publishing Technometrics. 

During the year 1959, the sponsoring organizations made only a nominal charge for 
various office and administrative costs which were incurred by them incident to process- 
ing subscriptions and preparing mailing list addressograph plates. It is understood that 
the American Statistical Association, which has assumed the primary functions of process- 
ing subscriptions, preparing addressograph plates and providing general administration 
of the magazine, will make a higher charge for such services beginning in 1960. 


ALEXANDER GRANT & CoMPANY 
Washington, D. C. 


March 7, 1960 
TECHNOMETRICS 
BALANCE SHEET 
DeEcEMBER 31, 1959 


Assets 


Savings account 
Accounts receivable 


Accounts payable 

Deferred subscriptions 

Advances from sponsoring organizations 
American Statistical Association 
American Society for Quality Control 


$19,992 


The accompanying report letter is an integral part of this balance sheet. 


$19,992 

Liabilities 
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TECUNOMETRICS 
STATEMENT OF INCOME AND Equity 


YEAR EnvEp DeEcEMBER 31, 1959 
Income 


Subscriptions 
Members of A.S.A. and A.S.Q.C 


Sales of reprints 
Less cost of reprints 255 


Advertising 150 
Interest 141 
Donations from sponsoring organizations 1,500 
Miscellaneous 2 


Total income $15,848 


Expenses 


Publication expenses 
Editorial 


13 ,049 


Promotion 1,603 
General and administrative 

Telephone and telegraph 

Supplies 320 


Net Income 
Equity—Beginning of Year.... 


Equity—End of Year 


The accompanying report letter is an integral part of this statement. 


Maintenance of subscription 511 

876 


The 1959 Proceedings 
of the 

Business & Economic 
Statistics Section 

of the 

American Statistical 
Association 


SESSIONS INCLUDE: 


The Securities Market 


Aspects of Stock Ownership in United States 
Corporations 


The Economic Outlook 


Business Fluctuations and Purchases of Capital 
Goods and Materials 


Appraisal of Current Anticipation Data 
Research on Productivity, Wages and Prices 


Controversial Areas in Federal Statistics: Antici- 
patory Data 


Controversial Areas in Federal Statistics: The 
Care and Maintenance of Index Numbers 


Statistical Needs for Construction and Mort- 
gage Finance 


Statistics for a Federal Transportation Policy 

International Measurement and Analysis of Sav- 
ings 

Some Special Studies Based on Recent Censuses 

Regional and National Accounts 


Problems of the Standard Metropolitan Statis- 
tical Area Definition 


Role of the Statistician in Policy Formation 


Price: to members, $3.25; to non-mem- 
bers, $3.75, if remittance is sent with 
order. An additional charge of $.25 
will be made for postage and handling 
on orders received without remittance. 


Order your copy from American Sta- 
tistical Association, 1757 K St., N.W., 
Washington 6, D.C. 


wile BESELER 


VU-GRAPH* 


OVERHEAD TRANSPARENCY 
PROJECTOR! 


PROJECTS A HUGE BRILLIANT 
IMAGE BEHIND YOU AS YOU 
FACE YOUR CLASS 


TEACH IN A FULLY LIGHTED ROOM — to watch 
Class reactions, permit note-taking 


PROJECT WHAT YOU WRITE, AS YOU WRITE IT 
ra) personalize lessons, emphasize specific 
points 

PROJECT ONE TRANSPARENCY OVER ANOTHER 
—to build a complete lesson, step-by-step, 
right before your students’ eyes 


TEACH DRAMATICALLY IN ANY SUBJECT — with 
— as unlimited as your own imag- 
ination. ... 


In statistical research, for example, can build a 
complete graph, step by step in 3 or i overlaid trans- 
parencies which show related trends such as in- 
——., medical aid, lengthened life span, and over- 
ion. 


WRITE FOR COMPLETE INFORMATION 
; Charles Beseler Company 
H 214 So. 18th Street, East Orange, New Jersey 
4 Send me complete information on the Beseler 
! VU-GRAPH OVER 

HEAD TRANSPARENCY 


O VU-LYTE Il OPAQUE PROJECTOR 


Have a Beseler A-V 
Consultant contact me for a 
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Address 


Since few companies have 
enough work volume to jus- 
tify computers of their own, 
STATISTICAL maintains com- 
puting equipment to serve any 
company on a low-cost, hourly, 
as-needed basis. 

This service is built around 
the combined skills of mathe- 
maticians, statisticians, project 
engineers and programming 
specialists—ready to work for 
you on your computing prob- 
lem. 


Here are a few of the appli- 
cations in which our computer 


Established 1933 


COMPUTING 
SERVICE 


..» Made to Order 
For Researchers 
and Statisticians 


STATISTICA 


TABULATING CORPORATION 


TABULATING + CALCULATING * TYPING 
TEMPORARY OFFICE PERSONNEL 


Please mention the Journal of the American Sratisticat Association in writing advertisers 


will excel in saving you time 
and money: 


e Simple and Multiple Cor- 
relations and Regressions 


e Analysis of Variance 
e Factor Analysis 


e Chi Square For A 
Contingency Table 


Matrix Calculations 
e Linear Programming 
e Curve and Surface Fitting 


e Solution of 
Differential Equations 


Just contact our nearest office 
today for a free analysis and 
cost.estimate of your problem. 


GENERAL OFFICES: 


53 West Jackson Boulevard 
Chicago 4, Illinois 
Phone: HArrison 7-4500 


Chicago @ New York @ St. Louis 
Newark © Cleveland 

Los Angeles @ Kansas City 

San Francisco @ Milwaukee 
Philadelphia @ Palo Alto 

Yan Nuys 
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Opportunities in Computer Programming 


COMMENT suseourme 10 Get FREE UST 


GEL. 
(FREE -)- “1; 
FREE L Join c (Loc), 18), 


Creating new systems of automatic programming 


One of the most challenging facets of pro- 
gramming is the creation and implementa- 
tion of automatic programming systems. 
Substantial progress has been made over the 
past three years. As in any pioneering effort, 
however, new problems constantly arise. Here 
are a few of the frontier areas of program- 
ming you might work in at IBM: 

New logical scanning techniques for pro- 
gramming languages .. . 

New, more powerful languages for the state- 
ment of problems... 

Processors to translate programming lan- 
guages into efficient machine language pro- 
grams 


Advanced languages powerful enough to 
describe their own processors .. . 


Programming concepts affecting logical ma- 
chine design... 
The general solution of these problems will 
lead to techniques applicable to many fam- 
ilies of computers. If you are a qualified 
programmer or mathematician and are inter- 
ested in joining professional people working 
in a professional atmosphere, with ample 
computer time to test your ideas, write to: 

Manager of Technical Employment 

IBM Corporation, Dept. 577R 

590 Madison Avenue 

New York 22, New York 


INTERNATIONAL BUSINESS MACHINES CORPORATION IBM 
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Exclusive 
Monro-Matic” 


AUTOMATIC 
SQUARING 


works it far 
_ faster for you! 


j 


Enter an x value into the Monro-Matic 
Statistical Calculator keyboard only once; 
the machine takes over the squaring; 
automatically, your task is nearly halved. 


This is one of many advanced features 
especially pleasing to people who live with 
figures. Why not ask your Man from 
Monroe for a free demonstration of the 
Monro-Matic Statistical Calculator today? 


ADDING 
[B a pivision oF uTTon inpusTREs DATA PROCESSING MACHINES 


Monroe Calculating Machine Company, Inc. 
Sales and service in principal cities everywhere. General offices, Orange, N. J. 
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Careers in 


Mathematics 


Vitro’s increased activities in the field of Operations 
Research have created career opportunities for men 
with these interests and qualifications: 


MATHEMATICAL STATISTICIANS 


MS or PhD for conducting and consulting on analytical 
studies in a wide variety of applications, including informa- 
tion theory, weapons systems analysis, experimental design, 
data treatment. Should be familiar with some of the follow- 
ing—Monte Carlo procedures, Markov processes, decision 
theory, operations research, and have had 3-5 years industrial 
experience in implementing these techniques. Position is in 
the Information Analysis Group. 


OPERATIONS RESEARCH ANALYSTS 


MA or PhD in Mathematics, Statistics or Physics. Conduct 
and direct operations research studies, principally in the 
areas of weapons systems evaluation, ballistic missile de- 
fense, anti-submarine warfare and electronic countermeas- 
ures. Should have experience in some of the following areas: 
applications of game theory, linear programming, Monte 
Carlo techniques, queueing theory and model construction. 


> Our modern laboratory is located in a suburban area with 
easy access to the cultural and educational facilities of metro- 
politan New York and New Jersey. Liberal benefits include 
a tuition refund plan and relocation allowances. 


Send resume and salary requirements to Mr. S. 
Roberts. 


Div. of Vitro Corp. of America 
200 Pleasant Valley Way West Orange, New Jersey 
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Announcing 


MODERN 
ELEMENTARY 
STATISTICS, 2nd. Ed. (1960) 


by Joun E. Freunp, Arizona State University 


An increased emphasis on statistical inference characterizes the 
new edition of this successful introduction to statistics. New 


material includes: 


@ chapters on tests of hypotheses and nonparametric 


tests 


@ brief discussion of game theory and decision-making 
@ some change in symbolism and the definition of 


standard deviation 


@ annotated bibliographies at the ends of all chapters 


The book continues to provide all the basic tools for statistical 
use in the natural and social sciences without requiring an ex- 
tensive mathematical background. 


TABLE OF CONTENTS 


PART I: 

Introduction 

Frequency Distributions 

Measures of Location 

Measures of Variation 

Further Descriptions: Sym- 
metry, Skewness, Peakedness 

Further Descriptions: Index 
Numbers 


PART II: 

Probability, Expectation, and 
Decision-Making 

Theoretical Distributions 

Sampling and Sampling Dis- 
tributions 

Problems of Estimation 

Tests of Hypotheses 

Nonparametric Tests 


May 1960 


App. 416 pp. 


PART III: 
Linear Regression 
The Co-efficient of Correlation 
Further Problems of Correla- 
tion 
Time Series Analysis 
APPENDIX I: 
Rules of Summations 


APPENDIX II: 
Calculations with Rounded 
Numbers 


APPENDIX III: 


The Use of Square Root 
Tables 

Tables 

Answers 

Index 


Trade price: $9.35* 


* Book also available in textbook edition for quantity sales to schools 


and colleges. 


Te “sceive approval copies, write: Dept. ASA, BOX 903 


PRENTICE-HALL, Inc. 
Englewood Cliffs, New Jersey 
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A Chock List of 
NEW AND RECENT IRWIN TEXTS IN STATISTICS 


STATISTICS: With Applications in Management and Economics 
By Earl K. Bowen, Babson Institute 


BASIC STATISTICS FOR BUSINESS ECONOMICS 


By C. Frank Smith, State University of lowa, and D. A. Leabo, University of 
Michigan 


STATISTICS FOR BUSINESS DECISIONS 


By Ernest Kurnow, Gerald J. Glasser, and Frederick Ottman, all of New York 
University 


QUALITY CONTROL AND INDUSTRIAL STATISTICS 


Revised Edition 
By Acheson J. Duncan, The Johns Hopkins University 


WORKBOOK IN BUSINESS STATISTICS 


Fourth Edition 
By Louis F. Hampel, United Air Lines 


RICHARD D. IRWIN, INC. 


HOMEWOOD, ILLINOIS 


A major statistical analysis 
of Massachusetts Shipping. 
1697-1714 


By BERNARD BAILYN AND LOTTE BAILYN. This unique volume 
serves two purposes in its exhaustive statistical analysis of the Massachusetts 
Register for the years 1697-1714. The first, achieved through its 33 statistical 
tables, is to create a new body of historical data which will provide a scale 
of measurement and a realistic sense of magnitude of the development of 
mercantile life in Colonial America. The statistical compilations were made 
by IBM processes, and the second purpose of the study is to investigate the 
possibilities and problems of applying machine techniques to historical data 


of this sort. Introduction, tables, and special procedural appendix. $5.00 


At all booksellers 


UNIVERSITY PRESS 
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Mathematical Methods in the 
Social Sciences, 1959 


PROCEEDINGS OF THE FIRST STANFORD SYMPOSIUM. Edited by 
Kenneth J. Arrow, Samuel Karlin, and Patrick Suppes. 
Papers presented at Stanford in June, 1959, in econometrics, opera- 
tions research, and mathematical learning theory. Stanford Mathe- 
matical Studies in the Social Sciences, lV. $8.50 


Contributions to 
Probability and Statistics 


ESSAYS IN HONOR OF HAROLD HOTELLING. Edited by Ingram Ol- 
kin, Sudhish G. Ghurye, Wassily Hoeffding, William G. 
Madow, Henry B. Mann. Forty papers in the fields of statistics, 
probability theory, economics, and biology. Stanford Studies in 
Mathematics and Statistics, /1. $6.50 


Order from your bookstore, please 
STANFORD UNIVERSITY PRESS 


Mathematical Statisticians 


Exceptional opportunities exist at the Naval 
Weapons Laboratory for mathematical statis- 
ticians with Bachelor’s, Master’s and Doctor’s 
degrees and an interest in operations re- 
search. The principal efforts of the Opera- 
tions Research Group at present are devoted 
to the formulation and execution of extensive 
programs in the areas of target analysis, 
weapons system analysis, and missile feasi- 
bility and evaluation. Senior Statisticians on 
the staff also serve as consultants in areas of 


statistical inference, probability, and experi- 
mental design. The most advanced computing 
equipment and capable junior scientists are 
available for assistance. Starting salaries 
range from $5,880 to $11,595 per annum. The 
Naval Weapons Laboratory provides an ex- 
cellent work atmosphere and, in addition, the 
advantages of living in a pleasant small com- 
munity with economical housing and many 
recreational facilities. 


For further information, write to the Director, 
Computation and Analysis Laboratory. 


NWL 


U. S. Naval Weapons Laboratory 
Department of the Navy ® Dahlgren, Virginia 
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BIOMETRICS 


Journal of the Biometric Society 
Vol. 16, No. 2 CONTENTS June 1960 
The Role of Statistics in Physiological Research C. W. Emmens 


Rank Analysis of Incomplete Block Designs: Method of Paired Comparisons Em- 
ploying Unequal Repetitions on Pairs Otto Dykstra 


The Treatment of Reciprocal Interaction, with or without Lag, in Path Analysis 
Sewall Wright 


Estimating the Parameter in a Conditional Poisson Distribution A. C. Cohen, Jr. 


Models for the Interpretation of Experiments Using Tracer Compounds 
Jerome Cornfield, Jesse Steinfeld, and Samuel W. Greenhouse 


Genetic Correlations With Multiple Alleles R. G. Stanton 
Analysis of Experiments Measuring Threshold Taste E. K. Harris 


An Unbiased Sampling and Estimation Procedure for Creel Censuses of Fishermen 
D. S. Robson 


The Mathematical Theory of Biological Assay of a Local Anaesthetic N. K. Chakravartt 


Queries and Notes 


Calculation of Inbreeding in Family Selection Studies on the IBM 650 Data Proc- 
essing Machine K. Hoen and A. H. E. Grandage 


A Note on Optimum Family Size in Selection Programs Allan Robertson 
On Heuristic Estimation Methods Malcolm E. Turner 


A Genetic Application of the Schumann-Bradley Table Stanley Wearden 


Biometrics is published quarterly. Its objects are to describe and exemplify the use of 
mathematical and statistical methods in biological and related sciences, in a form assimi- 
lable by experimenters. Members of the American Statistical Association may subscribe 
through the Association at the rate of $4 yearly. The annual non-member subscription 
rate is $7. Inquiries, orders for back issues and non-member subscriptions should be 
addressed to: 


BIOMETRICS 
Department of Statistics 
The Florida State University 
Tallahassee, Florida 
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The Annals of Mathematical Statistics 


THE OFFICIAL JOURNAL OF THE INSTITUTE OF 
MATHEMATICAL STATISTICS 


Vol. 31, No. 2—June, 1960 


Contents 


Slippage Problems Samuel Karlin and Donald Truax 


Effect on the Minimal Complete Class of Tests of Changes in the Testing Problem 
D. L. Burkholder 


Some Properties of a Class of Bayos Two-Stage Tests Morris Skibinsky 


Lower Bounds for the Expected Sample Size and the Average Risk of a Sequential 
Procedure Wassily Hoeffding 


Sampling Inspection as a Minimum Loss Problem ............ B. L. Van der Waerden 


A Generalized Pitman Efficiency for Non-parametric Tests H. Witting 
Contributions to the Theory of Rank Order Statistics: The Two-sample Censored 

V.R. Rao, I. R. Savage, and M, Sobel 
On a result by M. Rosenblatt concerning the Von Mises-Smirnov Test 
A Sequential Design for the Two Armed Bandit Walter Vogel 
An Asymptotic Minimax Theorem for the Two Armed Bandit Problem ....Walter Vogel 
Elementary Proof of the AEP of Information Theory A. J. Thomasian 
Products of Random Matrices H. Furstenberg and H. Kesten 
An Exponential Bound for Functions of a Markov Chain 

Melvin L. Katz and A. J. Thomasian 
Application of Storage Theory to Queues With Poisson Arrivals N. U. Prabhu 
A One-sided Analog of Kolmogorov’s Inequality Albert W. Marshall 
A One-sided Inequality of the Chebyshev Type ..Albert W. Marshal and Ingram Olkin 
On the Uniqueness of the Triangular Association Scheme 


Proper Spaces Related to Triangular Partially Balanced Incomplete Block Designs 


Notes: 
The First-Passage Moments and the Invariant Measure of a Markov Chain .... 
John Lamperti 
Note on the Distribution of Locally Maximal Elements in a Random Sample 
Contribtuions to the Theory of Rank Order Statistics: Computation Rules for 
Probabilities of Rank Orders Richard Savage 


W adie F. Mikhail 


Correction Notes 
Abstracts of Papers 
News and Notices 
Publications Received 


Address orders for subscriptions and back b to Professor George E. Nichol- 
son, Jr., Secretary, Institute of Mathematical Statistics, Department of Statistics, 
University of North Carolina, Chapel Hill, North Carolina. 
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a Simplified Estimation From Censored Normal Samples ...................W. J. Dixon 
ae) Estimating the Mean of a Finite Population ............-J. Roy and I. M. Chakravarti 
ei A Measure of Preictive Precision in Regression Analysis .................-.H. Linbart 
An inequality for Balanced Incomplete Block Designs ......... 
1 


JOURNAL OF THE ROYAL STATISTICAL SOCIETY 


Series A (General) Vol. 123, 1960 
Ann. Sub. post free, £4 4s Od, U.S. $12 


CONTENTS 
Part 1 


Statistics as a Tool of Management. The Inaugural Address of the President, Sir Hugh 
Beaver, K.B.E. (With Proceedings) . 


Practical Experimentation as a Teaching Method in Statistics. By G. H. Jowett and Hilda 
M. Davies (With Discussion). 


The Critical Community Size for Measles in the United States. By M. S. Bartlett. 
Card-Matching Experiments: a Conspectus of Theory. By N. T. Gridgeman. 
Business Concentration in the United Kingdom. By P. E. Hart. 


Comparative Death Rates per Person-Mile associated with various Forms of Transport, 
1952-57. By F. Garwood and Grace O. Jeffcoate. 


A Tally of Ale. By Evelyn M. Myatt-Price. 
Reviews of Books, Current Notes, Additions to Library. 


Series B (Methodological) Vol. 21, No. 2, 1959 
Ann. Sub. incl. postage, £3 2s 0d, U.S. $9 


CONTENTS 
Control Charts and Stochastic Processes. By G. A. Barnard. (With Discussion). 
Optimum Experimental Designs. By J. Kiefer. (With Discussion). 
A Contribution to the Theory of Bulk Queues. By Rupert G. Miller, Jr. 
Maximising a Function in a Convex Region. By G. Zoutendijk. 


Estimation of Location and Scale Parameters for the Rectangular Population from Cen- 
sored Samples. By A. E. Sarhan and B. G. Greenberg. 


The Distribution of Vacancies on a Line. By E. S. Page. 
The Output Process of the Queueing System M/G/1. By P. D. Finch. 


The Efficiency of Automatic Winding Machines with Constant Patrolling Time. By A. J. 
Howie and L. R. Shenton. 


The Comparison of Regression Variables. By E. J. Williams. 
The Most Economical Sequential Sampling Scheme for Inspection. By G. B. Wetherill. 


Note on Fisher’s Transformation of the Correlation Coefficient. By N. F. Laubscher. 


The Analysis of Experimentally Distributed Life-Times with Two Types of Failures. By 
D. R. Cox. 
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POPULATION STUDIES 


A Journal of Demography 
Edited by D. V. Grass and E. GreBenik 
Vol. XIII, No. 3 CONTENTS March 1960 


JOHN D. DURAND. The Population Statistics of China AD.2—1953. 

LEO SILBERMAN. Hung Liang-Chi, A Chinese Malthus. 

P. J. KOBLENZER and N. H. CARRIER. The Fertility, Mortality and Nupti- 
ality of the Rungus Dusun. 

H. V. MUHSAM. Population Estimates Based on Census Enumeration and Cov- 
erage Check. 


BOOK REVIEWS. 
Subscription price per volume of 3 parts 42s. net, post free 
(or American currency $6.75). 
Single parts £1 each plus postage (American $3.25, post free). 


Published by the POPULATION INVESTIGATION COMMITTEE, at the LONDON SCHOOL 
OF ECONOMICS and POLITICAL SCIENCE, 15 HOUGHTON STREET, LONDON, W.C.2. 


JOURNAL OF BUSINESS 


Graduate School of Business, University of Chicago, Chicago 37, lilinois 


VoLuME XXXIII APRIL 1960 NUMBER 2 


On the Appraisal of Cyclical Turning-Point Predictors Arthur M. Okun 
Prices of Falcon and Corvair Cars in Chicago and Selected Cities Allen F. Jung 
Plan and Equipment Lead Times Thomas Mayer 
Capital Budgeting and Pricing in the French Nationalized Industries Thomas Marschak 
Peak Load Pricing Marcel Boiteux 


Book Reviews 
Books Received 
Notes: University Schools of Business 


The JOURNAL OF BUSINESS is published quarterly by the Ueivers of Chicago Press. Subscrip- 
tions are $6.00 per year and should be addressed to the JOURNAL OF BUSINESS, Graduate School 
of Business, University of Chicago. Manuscripts in duplicate, typed and double- spaced including 
footnotes and quotations), and editorial correspondence should be addressed to Irving Schweiger, 
Editor, JOURNAL OF BUSINESS, at the same address. 
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BIOMETRIKA 


VOL. 47, PARTS 1 & 2 JUNE 1960 
CONTENTS 


Memoirs: 

BARTLETT, M. S., GOWER, J. C. & LESLIE, P. H. A comparison of theoretical and empirical 
results for some stochastic population models. 

KENDALL, David G. Birth-and-death processes, and the theory of carcinogenesis. 

ASHFORD, J. R., SMITH, C. S. & BROWN, Susannah. The quantal response analysis of a series 
of biological assays on the same subject. 

WALKER, A. M. Some consequences of superimposed error in time series analysis. 

FINCH, P. D. Deterministic customer impatience in the queueing systems GI/M/1. 

BARTON, D. E., DAVID, F. N. & FIX, Evelyn. The polykays of the natural numbers. 

SRIVASTAVA, A. B. L.’ The distribution of regression coefficients in samples from bivariate non- 
normal populations I. Theoretical investigation. 

KASTENBAUM, Marvin A. The separation of molecular compounds by countercurrent dialysis: a 
stochastic process. 

SAW, J. G. A note on the error after a number of terms of the David-Johnson series for the ex- 
pected values of normal order statistics. 

WATSON, G. S. More significance tests on the sphere. . 

JOHNSON, N. L. An approximation to the multinomial distribution: some properties and appli- 


cations. 
VAGHOLKAR, M. K. & WETHERILL, G. B. The most economical binomial sequential probability 

ratio test. 
McGREGOR, J. R. An approximate test for serial correlation in polynomial regression. 
BERKSON, Joseph. Nomograms for fitting the agit function by maximum likelihood. 
HAIGHT, Frank A. & BREUER, Melvin Allen. The Borel-Tanner distribution. 
BURR, E. J. The distribution of Kendail’s score S for a pair of tied rankings. 
Contributions by D. H. BHATE, C. R. YTH & C. CURME, C. BURROW. 

. CLUNIES-ROSS & R. H. RIFFENBURGH, D. FINCH, R. G. LAHA & & LUKA 

5 1. “LESLIE, H. D. PATTERSON, B. R. RAO, ce L. TURIN, C. A. WILKINS, M. E. WISE. 
Corrigenda: 
Reviews: 
Other Books received: 
The subscription, payable in advance, is 54/- (or $8.00) per volume (including postage). Cheques 
should be made payable to Biometrika, crossed ''A/c Biometrika Trust’ and sent 3 the Secretary, 
Biometrika Office, University College, London, W.C.1. All foreign cheques must be drawn on a 
Bank having a London agency. 

Issued by THE BIOMETRIKA OFFICE, University College, London 


THE JOURNAL OF FINANCE 


The Journal of THE AMERICAN FINANCE ASSOCIATION 


Vol. XV No. 2 


Financial Policies for Sustainable Growth Honorable Robert B. Anderson 


Gross Flows of Funds through Savings Institutions: Trends and Underlying Factors 
Kenneth M. Wright, Charles M. Torrance, ]. Brooke Willis 


Papers on Consumer Finance Wallace P. Mors, Carl A. Dauten 


Monetary Policy and the Financing Problems of State and Local Governmen 
Dick Frank E, Morris 


Financing Real Estate in the 1960's: Prospects, apie. and Progr: 
Vv. Geonge Pinnll, H Homer Hoyt, Miles L. *Colean 


Outlook for Capital and Money Markets 
Harold Barger, Robert Moore Fisher, Norris O. Johnson, Roy L. Reierson, Beryl W. Sprinkel 


Business Proceedings at A.F.A. Convention 


Membership dues, including $3.00 allocated to subscritpion in The Journal o; of Finance, are $5.00 
annually, Student subscription is $2.00 a year. Libraries may subscribe to The Journal at $5.00 
annually and single copies may be purchased for $1.25. Te for a in the Ameri- 
can Finance Association and subscriptions to The Journ Finance should be addressed to the 
Secretary-Treasurer, George E. Hassett, Graduate School of Business Administration, New York 
University, 100 Trinity Place, New York 6, New York. 


Communication relating to the contents of The Journal of Finance should be addressed to the Editor, 
Joel Segall, Graduate School of Business, University of Chicago, Chicago 37, Illinois, or to the 
+r aeeee Editor, John G. Gurley, Brookings Institution, 722 Jackson Place, N.W., Washington 6, 
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ESTADISTICA 


Journal of the 
Inter American Statistical Institute 


Vor. XVII, No. 65 DeEcEMBER 1959 
CONTENTS 


Sobre. los Dos Aspectos Diferentes del Método Representativo: El Método de Muestreo Estratifi- 
eado y el Método de Seleecion Dirigida (traduccién) Jerzy Neyman 
El Muestreo y la Situacién Actual de la Estadistica Francisco Azorin Poch 
Purposive and Stratified Random Sampling Today Frederick F. Stephan 
Estadisticas y Pardmetros Carlos E. Dieulefait 
La Théorie de l'Estimation et les Sondages ........... P. Thionet 
Some Stratified Sampling Plans in Replicated Designs W. Edwards Deming 
El Problema de la Estratificacién Optima (traduccion) Tore Dalenius 
El Muestreo Estratificado Multip étri (traduccion) ..... $ssnscocesacseoesed Tore Dalenius 
Recent Advances in Sampling Techniques Harry G. Romig 


agem da Safra de Trigo de 1958 no Rio Grande do Sul 
Thomas Jabine, Amaro da Costa Manteiro y Rubens Jorge de Campo 


Legal Provisions. Institute Affairs. Statistical News. Publications. 
Published quarterly Annual subscription price $3.00 (U.S.) 


Inter American Statistical Institute 
Pan American Union, Washington 6, D.C. 


ECONOMETRICA 


Journal of the Econometric Society 


July, 1960, Volume 28, Number 3 


CONTENTS 


JAMES DUSENBERRY, OTTO ECKSTEIN, and GARY FROMM: Stability and Instability 
in the American Economy 


A. H. LAND and A. G. DOIG: An Automatic Method of Solving Discrete Programming 
Problems 


RALPH E. GOMORY and WILLIAM J. BAUMOL: Integer Programming and Pricing 


I. RICHARD SAVAGE and KARL W. DEUTCH: A Statistical Model for the Gross Analysis 
of Transaction Flows 


A. L. NAGAR: A Monte Carlo Study of Alternative Simultaneous Equation Estimators 


GREGORY C, CHOW: Tests of Equality between Sets of Coefficients in Two Linear Regres- 
sions 


LIONEL W. McKENZIE: A Contribution to the Theory of Competitive Equilibrium 


W. V. CANDLER: A Short-Cut Method for the Complete Solution of Game Theory and 
Feed-Mix Problems 


CHARLES W. HOWE: An Alternative Proof of the Existence of General Equilibrium iin a 
von Neumann Model 


KENNETH J. ARROW and LEONID HURWICZ: Some Remarks on the Equilibria of Eco- 


nomic Sys' 
REPORT of the AMSTERDAM MEETING 
REPORT of the WASHINGTON MEETING 


BOOK REVIEWS 
ANNOUNCEMENTS and NOTES 
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AMERICAN ECONOMIC REVIEW 


VoLuME L JUNE 1960 NUMBER 3 


ARTICLES 


An Index of Soviet Industrial Output 

Returns Equalization and Regional Growth 

Monetary Reserves and Credit Control .............0ccceeceeeeeeeeees J. M. Henderson 
The ‘“‘Outer-Seven’’ and European Integration M. E. Kreinin 
Western Development of the Rhodesias Stephen Enke 
The Allocation of Joint Costs A, A. Walters 


COMMUNICATIONS 

A Regional Countermeasure to National Wage Standardization J. M. Buchanan and J. E. Moes 

Population and Economic Growth: 
H. H. Villard 

The Shape of Income Distribution: 
Stanley Lebergott 
The AMERICAN ECONOMIC REVIEW, a quarterly, is the official publication of the 
American Economic Association and is sent to all members. The annual dues are six dollars. 
Address editorial communications to Dr. Bernard F. Haley, Editor, AMERICAN ECONOMIC 
REVIEW, Stanford University, Stanford, California. For information concerning other publi- 
cations and activities of the’ Association, communicate with the Secretary-Treasurer, Dr. 


ames Washington Bell, American Economic Association, Northwestern University, Evanston, 
llinois, Send for information booklet. 


JOURNAL OF FARM ECONOMICS 
Published by 
THE AMERICAN FARM ECONOMIC ASSOCIATION 


Editor: Herman M. SourHwortH 
The Pennsylvania State University, University Park, Pennsylvania 


Volume XLII May, 1960 


Programming Smal] Watershed 
George A. Pavelis and John F. Timmons 
Evaluating a Farm Machine Prior to Its 
W. D. Toussaint and P. S. Stone 
The Economic Rugest ot Technical Assistance: a Brazilian Case Study 
Clifton R. Wharton, Jr. 
Estimates of the Aggregate U. S. Farm Supply Function . Zvi Griliches 
Land Substitutes and Changes in Corn Yields ..............-- Paul R. Johnson 
The Influence of “Impulse won or In-The-Store Decisions on Consumers’ 
Proceedings Papers, Joint Meetings of the American Farm Economic Associa- 
tion and Allied Social Science Associations 
Additional articles, notes, book reviews, and announcement of new bulletins and 
other publications in agricultural economics. 
Published in February, May, August, November, and December. $9 yearly. 


Secretary-Treasurer: C. Del Mar Kearl 
Department of Agricultural Economics 
Cornell University, Ithaca, New York 
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CHAPTER PRESIDENTS 


Herbert L. Bryan, Director of Research and Stat., New York State 
.* of Correction, 39 Columbia Street, Albany, New York 

N. K. Woerner, Dept. of Public Safety, North Austin Station, 
Austin, Texas 

Frederick Mosteller, Dept. of Statistics, Harvard University, 2 
Divinity Ave., Rm. 311, Cambridge 38, Massachusetts 

Richard N. Schmidt, School of Business Adm., University of 
Buffalo, Buffalo, N. Y. 

John R. Virts, 6143 Meadowlark Drive, Indianapolis 26, Indiana 

Robert J. Buehler, Statistical Lab., Iowa State College, Ames, Iowa 

Mrs. Gladys W. Ellsworth, Research & Statistics, N. J. State Dept. 
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Pearl A. Van Natta, Child Research Council, School of Med., Uni- 
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Robert B. Miner, Dept. of Business Organization, Ohio State Uni- 
versity, Columbus, Ohio 

Hilary L. Seal, > of Osborn Zoological Laboratory, 
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